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BLAST Basic Local Alignment Search Tool 

Job Title: M18930:Human hepsin mRNA, complete cds 



BLASTN 2.2.18+ 

RID: 36C7FNM2013 Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS , environmental 
samples or phase 0, 1 or 2 HTGS sequences) 6,839,787 sequences; 2 3,768,953,950 total letters 

Query= gi | 184371 | gb | M18 930 . 1 | HUMHPSNA Human hepsin mRNA, complete cds. Length=1783 



Distribution of 106 Blast Hits on the Query Sequence 
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AMgnmerrts_ 



>ref|NM 002151. 1 I Homo sapiens hepsin (transmembrane protease, 

transcript variant 2, mRNA 
gb|M18930.l|HUMHPSNA iM!!!! Human hepsi 



l mRNA, complete cds 



Score = 3293 bits 
Identities = 1783 
Strand=Plus/Plus 



Sbjct 



Query 181 
Sbjct 181 
Query 241 



TGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGC 18 0 

TGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGC 18 0 

TGGACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCA 24 0 

! 1 1 1 1 1 1 1 1 ; ' ^ 

TGGACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCA 24 0 

GTGACATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGG 3 0 0 
I I I I I I I I I I I I I I I I I I I I 

Sbjct 241 GTGACATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGG 300 

Query 301 CAGCTCTCACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCA 360 

Sbjct 301 CAGc!c!cAc!gCGGG^ 3.0 

Query 361 TTGTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTG 420 

Sbjct 361 TTGTGGCTGTTCTCCTCAGGAG^ 420 

Query 421 CGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCT 480 

ill MM MM MM III ' 

Sbjct 421 CGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCT 480 

Query 481 CGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCAC 540 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct 481 CGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCAC 540 
Query 541 
Sbjct 



541 

Query 601 

Sbjct 601 

Query 661 

Sbjct 661 

Sbjct 721 

Query 781 

Sbjct 781 

Query 841 



jGCACGTCGGGCTTCTTCT 60 0 

TGACCCACTCCGAGCTGGACGTGCG 600 

GTGTGGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTG 66 0 

GTGTGGACGAGGGGAGGCT 660 



GCCTTCGCTATGATGGAGCACACCTCTGTGGG3 i l 1 l 3 3GGACTGGGTGC 840 

NNIMMIIIIIMI^ 84o 



TGACAGCCGCCCACTGCTTCCCGG- 3 T I [ r T jTTTG 90 0 

90 o 

CCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCT 96 0 

MMMWMimMMm^ 960 

ACCACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTG 102 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACCACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTG 102 0 



M M M M M M M M M M M M M M M I 



M M M M M M M I 
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Sbjct 



141 CGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCA 
2 01 ATGATGTCTGCAATGGCGCTGAt 



1321 
1321 
1381 



GTGAGGACAGCATCTCTCGGACGCCACGTIO.: "G ?T 3T 3T^G1' VfT :-_-_3TTGGGGCA 



CTGGCTGTGCI 



Sbjct 
Sbjct 
Sbjct 



:CCTGGCCCAGAAGCCAGGC( 
Query 1441 GGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGAC 

II MM MM III M 

Sb] ct 1441 GGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGAC 
Query 1501 CGGTGGCTTCTCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACG 

MMMMMMMMMMMMMMMMMMIMMMMMMMMMMMI 

Sb]Ct 1501 CGGTGGCTTCTCGCTGCGCAGCCTCC- G 1 I I 7 " GGGTGGTGGGATCCACG 

Query 1561 CTGGGCCGAGGATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCT 

M M M M M M M M M M M M M M M M M M 1 1 1 M M M M M M M M M M I 

Sbj ct 1561 CTGGGCCGAGGATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCT 
Query 1621 CCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCT 

M M M M M M M M M M M M M M M M M Ml IMM M M M M M M M M M 

1621 CCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCT 



CTGATGATGGGATGCTCTTTAAATAATAAAGATGGTTTTGATT 



>gb|BC025716 

(cDNA clone I 
Length=1761 



CACCCTCCTGACCCCCATGTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCC 



CACCCTCCTGACCCCCATGTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCC 
CTGATGATGGGATGCTCTTTAAATAATAAAGATGGTTTTGA': 



. (transmembrane protease, 



:mbrane protease, serine 1) 
Ls 

1) [Homo sapier 



Strand=Plus/Plus 



GCCTGGCCTAGCAGGCCCCACGCCACCGCCTCTGCCTCCAGGCCGCCCGCTGCTGCGGGG 10 8 

CCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCC 16 8 

M M M M M M M M M M M M M M M M M M MMM M M M M M M M M M 

" r ' -"""CTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCC 12 0 



CCACCATGCTCCTGC 



Query 22 9 
Sbjct 181 



GCCCCCACCTGCTGGACCCCAGGGTCCCACC^T ; ' :AGGAG C " 3CCAGGGAATCAT 22! 

MMMMMMMMMMMMMMMIMMMMMMMMMMMMMMI 

GCCCCCACCTGCTGGACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCAT 1 8 ( 
TAACAAGAGGCAGTGACATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCA 2 8 ! 
TAACAAGAGGCAGTGACATGGCGCAGAAGGA 24< 
GACCCAAGGTGGCAGCTCTCACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGG 
GACCCAAGGTGGCAGCTCTCACT 
CATCCTGGGCCATTGTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGC 

NMMIIIIIIIIIMI^ 

AGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGC 

MMMMMMMMMMMMMMMMIMMMMMMMMMMMMMI 

AGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGC 



TCCTCAGGGCACTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGT 58 8 

wmLWLm^^ 540 



CGGGCTTCTTCTGTGTGGACGAGGGGAGGCT 
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sbjct 72: 

Query 82! 
Sbjct 

Query 88! 

Sbjct 84 

Query 94 

Sbjct 90 

Sbjct 96 

Sbjct 10. 

Sbjct 10 

Sbjct 1141 

Query 124 9 

Sbjct 1201 

Query 1309 



:CGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCC 



GCAGGAAGCTGCCCGTGGi 



GGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGAT 88 8 




'GCATCGTGGGAGGCCGGKjAC 



9 GCAACGATATTGCCCTGGTCCACCTCTCCAGT ' 'GGTGO: "TCACAGAATACATCCAGC 

II MM MM MM III MM i 

GCAACGATATTGCCCTGGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGC 
9 CTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 CTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGG 
9 GCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 GCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCC 
9 CCATAATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCA 



CciliilciGciilGilGlclGC^GCG^iii 

tltlMIMMMIMMIMM^ 



MMMMMMMMMI 



TGAGTTGGGG-- II, TGT T AA AGTCAGTG 



Query 142 9 ACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCJ 



MMMMMMMMMI 

ACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCA' 



Query 14 8 9 
Sbjct 1441 



GGGAGGTGTGAGGGGTGGGTTGTGGGTGGGGAGGGTGA 



CCCAGCTCTGACCGGTGGC 
GTGGGATCCACGCTGGGCCGAGGATGGGACGTT' 

MMMMMMMMMI 




ICTTCTCGCTG ' - 1 1 , i i "CGAGGTGATCCCGGTG 



vGCGGCATGGTGA 



'1 rCT fGGGCCCGGTCCACAGGTC 



Query 1669 

Sbjct 1621 

Query 172 9 

Sbjct 1681 



ACCACCCAACclcACCclcc!^ 



GTCTAGGTGCCCCTGATGATGGGATGCTCTTTAAATAATAAAGATGGTTTTGATT 178 3 

mm™iMm™j«y[ii i7 35 
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>ref |XM_001093578.1 | 

1) , transcript variant 2 i 
Length=1785 



HPN) , mRNA 



.'transmembrane protease. 



protease, serine 



GAGCCCGCTTTCGAGGGACGCTACCTGAGGGCCCACAGGTGAGGCAGCCTGGCCTAGCAG 

I I I I I I I I I I I I I I I I I I I I I I I I I MUM MM III M 

GAGCCCGCTTTCCAGGGACCCTACCCGAGGGCCCACAGGTGAGGCAGCCTGGCCTAGCAG 

GCCCCACGCCACCGCCTCTGCCTCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I i I M I 1 I I I I I I I I I I I I 
GCCCCACGCCACCGCTTCTGCCTCCAGGCCACCCGCTGCTGCGGGGCCACCATGCTCCTG 



MMMMMMMMIMMIMIMM II Mill I I I 1 M 1 I I I I lllllll I 
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GACCCCAGGGTCCCACCCTGGCCCA33A 7 - - kATCATl -.A:AAGAGGCAGT 

- iMI MM |:| i I I 

GACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCGGT 



GCTCTCACT3 !G i i , I , i i , i i ?. 6 2 

INMIIIIIMIIIIMlim^ 3S2 

GTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCG 42 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTCTACCCAGTGCAGGTCAGCTCTGCG 42 2 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



IMMMIIIIMIIIIMlim 

GTGGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGAT 

MMI MMMMMMM M M M M M M M M 1 1 M M M M M M M M M M 

GTGGATGAGGGGAGGCTGCCACACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGAC 
TGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCC 

1 1 MMMMMMMMMMI 1 1 M M M M 1 1 1 M M M M M M M M M M I 

TGTCCCAGAGGCCGTTTCTTGGCCACCGTCTGCCAAGACTGTGGCCGCAGGAAGCTGCCC 



c!!ogc!a!ga!g^ 

acagccgcccactgcttcccggagcggaaccgggtcctgtcccgatggcgagtgtttgcc 

icAGCTGCCCACTGCTTCCre 



CACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTGC': 

MMMMMMMMMMMMMMMMMMMMMMMMM MMMMI 

CACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAATGATATTGCC 
CAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAAT 

M M M M M M M M M M M M M M M M M MM MM M M M M M M M M M 

CAGTACTATGGCCAACAG3 , , , 1 1 , , 3 3A 3TCCCCATAATCAGCAAT 



Query 12 03 GATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCT 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

Sbj ct 1203 GATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCT 



l2 63 GGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTC 



1323 
1323 



Sbjct 



GAGGACAGCATCTCTCGGACGCCACGTT III -IT T , - 3TTGGGGCACT 



LGGCGTCTACACCAAAGTCAGTGACT 
ATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCG 

MMMMMMMMMMMMMMMMIMMMMMMMMMMMMMI 

ATCTTCCAGGCCATAAAGACTCACT 33 3 1 33TCTGACCG 



GGGCCGAGGATGGGACGT1 



. - 'TCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCC 

Query 162 3 CTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCA 
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CCCTCCTGACCCCCATGTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCT 1742 

Ill MM MM III I IMMMMMM 

r " imm ~TTCTGCTGTCTGGGACTCCTCTCTAGGTGCCCCT 1742 



GATGATGGGATGCTCTTTAAATAATAAAGATGGT1 



i ! : M TT T TTlT TTfTTlT TTT TfTTTTTtTT 

GATGACGGGATGCTCTTTAAATAATAAAGATGGTTTT3-.TT 



igo abelii mRNA; cDNA DKFZp469J 



Score = 3000 bits (1624 
Identities = 1676/1702 
Strand=Plus/Plus 



Sbjct 198 

Query 322 

Sbjct 258 

Query 3 82 

Sbjct 318 

Sbjct 378 



(from clone DKFZp4 6 9A1831) 




GGCCCAGGAGGTCAGCCAGGGAATCATTAACA.- 3A - 7 , 3 -3CAGAAGGAGG 261 

MMMMMMMMMMMMMMMMMMM MMMMMMMMMMI 

Sbjct 13 8 GGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCGGTGACATGGCGCAGAAGGAGG 197 

Query 262 GTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGGACCC 321 



MMMMMMMMMMIMMMMMMMMMMMMMI 

GTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGGACCC 2 5 7 

^CAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGTTCTCCTCAGGA 317 

GTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCT 441 

M M M M M M M M M M M M M M M M M M MMM M M M M M M M M M 

GTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCT 3 7 7 

TTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAG 501 

M M M M M M M M M M M M M M M M I 1 1 1 I I I I M M M M M M M M M M 

ttgA':aagA':ggaagggA':gtgG':gG':tG':tgtG':T':':T':G':G':T':':aA':G':>:agggtag 437 

CCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGAGCTGGACG 561 

CCGGACTCAGCTGCGTGGAGATGGGCT 4 9 7 



Query 682 

Sbjct 618 

Query 742 

Sbjct 678 

Query 802 

Sbjct 738 

Query 862 
Sbjct 

Sbjct 



3CCGTTTCT 617 

TGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAG 741 

MMI MMMMMMMMMMMMIMMMMMMMMMMMMMMI 

- - - - CCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAG 67 7 



I!ACCAGCTTGGGCCGGTGGCCGTGGCAA 737 



MMMMMMMMMIMMMMMMMMMMMM 



iM MM MM IN m 



CTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTAC'rACGGGGGCTATCl 



AGAGCGGAACCGGGT 
_TCCCCACGGTCTGCA' 

MMMMM MMI 



MMMMMMMM 



M M M M M M M M 



MMMMMMMM, 



Sbjct 



MMMMMMMM 



MMMMMMMIMMMMMMMMMMMMI I 



CCCTGCCCCTCACAGAATVC-TCC-G^ T I ^"-GGCCCTGG 



MMMMMMMMMMMMMMMMMMM MM 



CCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGG 
TGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGG 



MMMMMMMMMIMMMMMMMMMMMM 



103 8 TGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGG 



MMIM MM MMI' 



IMIIMI MM 
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AGCCAGGCGTCTACACCAAAGTCAGTGACTT- I -in jATAAAGA 

AGCCAGGCGTCTACACCAAAGTCAGTGACTT 

CTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGCTTCTCGCTGCGCAG 

1 1 1 1 III MM MM III III 

CTCATTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGCTTCTCGCTGCGCAG 



AATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCTGATGATGGGATGCTCTTTA 

' ' i ' I I I I I ' I I I I I I I I I I I I I 

AATATTGTTCTGCTGTCTGGG 3GGATGCTCTTTA 



Sbjct 



>ref|NM 182983 . 1 I IIIIE Homo sapiens heps: 
transcript variant 1, mRNA 

emb|X07732 .1|HSHEPSH CLIilli] Human hepatoma mRNA f. 
Length=2363 



protease, serine 1 
protease hepsin 
1) [Homo sapiens] 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



Strand=Plus/Plus 



Query 431 

Sbjct 1011 

Query 4 91 

Sbjct 1071 

Sbjct 

Query 61 

Sbjct 11 

Query 671 



131 



GCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCC, 



iCTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCAC 
TGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 



I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCGGGGACCCTGCTACTTCTGACAl 



'GACAGCCATCGGGGCGGCATCCTGGGCCATTC 
TCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



TCTCCTCAGGAGTGACCAGGAGCCG ,T - L 1,3,3a , T T3CGGACGCTCG 1010 



GCTCATGGTCTTTGACA 
GCTCATGGTCTTTGAc! 



CGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGC 
llllllllllllllllllllllllllll 



iCAAGACGGAAGGGACGTGGCGGCTGCTGTGCT 



CGAGCTGGACGTGCGAACGGCGGGCGCCi 

3GGGAGGCTGCCCCACACCCAGAGGCTG' 
llllllllllllllllllllllllllll 



1251 



llllllllllllllllllllllllllll 



._-.CAAGAGGCAGTGACATGGC 

■UliiMioiGGciGlGicilGGC 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

.:gggG':gG':aT':':tggG':>:attgtgG':tgt 



I I I I I I I I I I I II MM 1 1 1 1 1 1 1 1 1 1 1 1 



iT(MCACGTC(MGCTTCTTCTGTGTGGA 
GGAGGTCATCTCCGTGTGTGATTGCCCCAG 

II MM MM III il 



GGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGATTGCCCCAG 6 7 ( 



191 GGGGAGGCTGCCCCACACCCAGA 3G 3T 



II MM MM III MM II 



:CGCCATCTGC:.-_- 3 1 l -A33AAGCTGCCCGTGGACCG 1310 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
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ccactgcttcccggagcggaaccgg3t;-t i ;at - t ,i ttgccggtgccgt 

:::iiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiii 

ccactgcttcccggagcggaaccgggtcctgtcccgatggcgagtgtttgccggtgccgt 



1551 
1031 



CTATCTTCCCTTTCGGGACCCCAACA3" , - - TTGCCCTGGTCCA 

MMIIIIIIIIMI^ 

CCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGG 

Jl MM MM Ml M. 

CCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCT IT IT M ITCCCAGCTGCCGG 



Sbjct 



2271 
1751 
2331 



J I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Query 1331 CAT ~*T i T TGTGT TOT ,TTGGGGCACTGGCTGTGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I [ M 1 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sb]Ct 1911 CATCTCTCGGACGCCACGTTGGCGGCTGT 3TTGGGGCACTGGCTGTGC 



M M M M M M M M M M M I 



1610 
1090 



1850 
1330 



CCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCA 
GGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGCTTC 

,,3i ^1111^!^ 

1511 TCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAG 

2091 1^^!^ 

1571 GATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGG 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

2151 GATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGG 
1631 TCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTG 

M M M M M M M M M M M M M M M M M I II M M M M M M M M M M M I 

2211 TCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTG 
GTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCTGATGATGG 



1450 
2030 

2090 
1570 
2150 
1630 
2210 
1690 
2270 
1750 
2330 



GATGCT CTTT AAAT AAT AAAGATGGTTTTGATT 1783 
^MmiiMiimiJlii 2363 



Strand=Plus/Plus 



TCGAGCCCGCTTTCCAGGGACCCTA^ -T i I 3 r "TGGCCTAGC 

TCGAGCCCGCTTTCCAGGGACCCTACCTGAGGGCC 



TGCCCAGGCCTGGAGACT3- T 3 -CCCCACCTGC 

MM MM MM III M : 

TGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGC 



Query 181 TGGACCCCAGGGT 



Iggaccccaggg! 19 ; 



>ref |XM_001093699.1 I PREDICTED: Macaca r 

1) , transcript variant 3 (HPN) , mRNA 
Length=2363 



jlatta hepsin (transmembrane protease, ! 
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Score = 2754 1 



IMIII MM III ' MMMMM 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



GCAGAAGGAGGGTGGCCGGACTGTGCCkTGCTGCTCCAGACCCAiGGTGGCAGCTCTCAC 
GCAGAAGGAGGGTGGCCGGACTCT 

TGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 

Ml MMMMMMMMMMMMMMMIMIMMMMMMMMMMM 

TGCAGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 
TCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 

M M M M M M M M M M M M M I M M M M M M M M M M M M M M M M 

TCTCCTCAGGAGTGACCAGGAGCCGCTCTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 
GCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCGCTCCAA 

MMMMMMMMMMMMMMMMMMIMMI MMMM MMMM 

GCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTATGCTCCTCACGCTCCAA 
CGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGGCTTCTTCTGTGTGGACGA 

MM M M M M M M M M M M M M I MMMM MMMMMMMMI M 

CGAGTTGGACGTGCGAACGGCGGGCGCCAACGGCACGTCAGGCTTCTTCTGTGTGGATGA 
GGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGATTGCCCCAG 

MMMMMM MMMMMMMMMMMMIMMMMMMI M MMI 

GGGGAGGCTGCCACACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGACTGTCCCAG 



CATCGTGGGAGGCCGGGACACCAG^ 
TGATGGAGCACACCTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGC 



8 51 CCkCTGCTTCCCGGkGCGGA_ACCGGGTCCTGTCCCGkTGGCGkGTGTTTGCCGGTGCCGT 

M M M M M M M M M M M M M M I MM MM MM MM M M M M M M M I 

142 9 CCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGT 
911 GGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGG 

M M I I ! I M I M I I I I I I I I MMMMMMMMIMMMMMMMMMMI 

14 8 9 GGCCCAGGCCTCTCCCCACGGCCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGG 
971 CTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTGCCCTGGTCCA 

is49 im^mum^ 

1031 CCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGG 
1609 

Que ry 10 91 CCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTA 

M M M M M M M M M M M M M M M M M MM MM M M M M M M M M M 

Sbj ct 166 9 CCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTA 
Que ry 1151 TGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTG 

M M M M M M M M M M M M M M M M M M MMM M M M M M M M M M 

Sbj ct 172 9 TGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTG 

Que ry 1211 CAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCC 

Sbjct 17B9 CAATGGCGCTGACTTCTM 

Que ry 12 71 CGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGACAG 

Sbjct 1349 CGAGGGTGGCATTGATGCCTG 



Sbjct 

Query 1451 

Sbjct 2029 

Query 1511 



CCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGA n , - 3TGGATCTTCCA 

MMMMMMMMMMMMMMMIMMMMMMMMMMMMMMI 

CCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCA 



TCGCTGCGCAGCCTCCAGGG ' iG iGGATCCACGCTGGGCCGAG 



Query 1571 GATGGGACGTT' 



rCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGG 
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TCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTG 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII I I I I I I I I I I I I I I 

•" ""CAGCCCCGAGACCACCCGACCTCACCCTCCTG 



Sbjct 



GATGCT CTTT AAAT AA1 - 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GATGCT CTTT AAAT AAT, : 



Strand=Plus/Plus 



GAGCCCGCTTTCCAGGGACCCTACCTGAGGGCCCACAGGTGAGGCAGCCTGGCCTAGCAG 62 

GAGCCCGCTTTCCAGGGACCCTA 62 

GCCCCACGCCACCGCCTCTGCCTCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTG 122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I ' 

GCCCCACGCCACCGCTTCTGCCTCCAGGCCACCCGOT 3' 'T - rGGGGCCACCATGCTCCTG 122 



12 3 CCCAGGCCTGGAGACTGACCCGACCCCGGAACCACCTCCAGGCTCCGCCCTCACCTGCCG 



J Homo sapiens cDNA, FLJ9674 6 



Score = 2645 bits (1432), Expect =0.0 
Identities = 1432/1432 (100%) , Gaps = 0/1432 (0%) 
Strand=Plus/Plus 

Query 68 ACGCCACCGCCTCTGCCTCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTGCCCAG 12 7 
Sbjct 1 ACGCCACCGCCTCTGCCTCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTGCCCAG 60 

Query 128 GCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGCTGGACCC 187 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llll MM MM MM MM III II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct 61 GCCTGGAGACTGACCCGACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGCTGGACCC 12 0 
Query 188 CAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACAT 24 7 



CAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACAT 18 0 
Query 248 GGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCT 307 



Sbjct 



Query 788 
Sbjct 72: 



1 1 1 1 1 

CAGGG 

GGCGC. 
Mill 
GGCGC 

CACTG 
I I I I I 



GGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCT 24 0 
CACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGC 3 6 7 



CTCCGAGCTGGA^^ 



TCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCGCTC 4 2 ■ 
CAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCA 54 ' 



CAACGCCAGGGTAGCCGGACTC-G^T r 3 33CACTGACCCA 480 



Ml 



I I 



'CTTCTGTGTGGA 

llll 



CAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGA 66 0 

CCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCG 78 7 
II llll llll IM < 

CCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCG 72 0 

CTATGATGGAGCACACCTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGC 84 7 

CTATGATGGAG^^ 780 

CGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGC 90 7 
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Sbjct 
Sbjct 
Sbjct 



CGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGG 



I I I I I I I I I I I I I I I I I I I I 



iGCTGGGGG 1 ] 



lllllllllllll 

1TGCAGGCTGTGGTCTACCACGG 



n MM MM III M 



sbjct 90: 

Query 102 8 CCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGC 

M M M M M M M M M M M M M M M M M Ml MM MM M M M M M M M I 

Sbj ct 961 CCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGC 

Query 1088 CGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTA 

Sbj ct 1021 CGGCCAGGCCCTGGTGGATGGCAAGATCTGTA 

Query 114 8 CTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

Sbj ct 1081 CTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGT 
Query 12 08 CTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTA 

MMMMMMMMMMMMMMMMMMMMMMMMMI 

Sbj ct 1141 CTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGi^iuo^ift 

Query 12 68 CCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGA 

8b] ct 1201 CCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGA 

Query 132 8 CAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGCTG 

M M M M M M M M M M M M M M M M M Ml MM M M M M M M M M M I 

12 61 CAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGCTG 

13 88 TGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

1321 TGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTT 
1448 CCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGA 14 99 

: - 

1381 CCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGA 1432 



>gb|DQ895314.2| El Synthetic construct Homo sapiens clone IMAGE : 10000 9774 ; 
RZPDO83 9H0413 9D hepsin (transmembrane protease, serine 
1) (HPN) gene, encodes complete protein 
Length=12 94 



Score = 2309 bits (1250), Expect =0.0 
Identities = 1252/1253 (99%), Gaps = 0/1253 
Strand=Plus/Plus 

Query 245 CATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGC 304 



Sbjct 142 

Sbjct 202 

Query 485 

Sbjct 262 

Query 54 5 

Sbjct 322 

Sbjct 382 

Sbjct 442 



MMMMMMMMMMMMMMMIMM 



CATGGCGCAGAAGGAGGGTGGCCGGACTGTG I! TGCTGCT 3 "CAAGGTGGCAGC 8: 



TCTCACTGCGGGGACCCTGCTACTTCTGACAGCCA 

nnninnisniniininnnnni 

?T?T????T?TTT?T?TTT?T?TT?T?T?TTT??T 

CGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCG 2 61 

CTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGAC 54 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



CTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGAC 321 



GGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGC 



CCCCAGAGGCCGTTTCTTGGCCGr k3 21 3CCCGT 



I MM UJ I I I I I I I I I I I I I I I I 



c«Ugcggcatcctgggccattgt 

nnnninnnnnnni 

gtggcggctgctgtgctcctcgcg 
gatgggcttcctcagggcactgac 

llllllllllllllllllllllll 



iCTTCTTCTGTGT 

fflfflflfflnnniiisi 

CTGTGGCCGCAGGAAGCTGCCCGT 



Query 785 ^ - - - 

Sbjct 562 TCGCTATGATGGAGCACACCTCTGTGGG^ 

Query 84 5 AGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGG 



)://blast.ncbi.nlm.nih.( 



5/20/2008 



NCBI Blast:M18930:Human hepsin mRNA, complete cds 



Page 14 of 55 



Sbjct 
Sbjct 
Sbjct 



M MM MM MM 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Sbjct 74 

Query 1025 GGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGC 1084 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct 802 GGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGC 861 

Query 1085 TGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCA 1144 

:cggccaggccctggtggatggcaaga 921 



Sbjct 



gtactatggccaacaggccggggtactccaggaggctcgagtccccataatcagcaatga 
:tgcaatggcgctgacttctatggaaaccagatcaagcccaagatgttctgtgctgg 
. 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I ! I i 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



CTACCCCGAGGGTGGCATTGATGCCT 3 I T TTTGTGTGTGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGA 
GGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 m i m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGG 
CTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGAT 

::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii m 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGAT 
144 5 CTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 14 97 
1222 CTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 12 74 



>gb|DQ892119.2| E! Synthetic construct clone IMAGE : 1 
hepsin (transmembrane protease, serine 1) (HPN) 
gene, encodes complete protein 
Length=12 94 



05.01X; RZPDO839H0414 



Score = 2309 bits (1250), Expect =0.0 
Identities = 1252/1253 (99%), Gaps = 0/1253 (0%) 
Strand=Plus/Plus 

Query 245 CATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGC 304 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMM 



CATGGCGCAGAAGGAGGGTGGCCGGACTGTG I! TGCTGCT 3 "CAAGGTGGCAGC 8: 



Sbjct 142 

Sbjct 202 

Query 485 

Sbjct 262 

Query 54 5 

Sbjct 322 

Sbjct 382 

Sbjct 442 



TCT^CTGCGclclGACCCTGCTACTTCTGACAG^i 

nnninnisniniininnnnni 

?T?T????T?TTT?T?TTT?T?TT?T?T?TTT??T 

CGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCG 2 61 

CTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGAC 54 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



CTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGAC 321 



CCACTCCGAGCTGGACGT 



^Gi^iGGGGiGGclGC^kk^UGiGGclGC 



CCCCAGAGGCCGTTTCTTGGCCGr k3 2T 3CCCGT 



IIIIIMIMIIIIIIIIIIIIII 



ccLgggcggcatcctgggccattgt 

nnnnmniinnnni 

gtggcggctgctgtgctcctcgcg 
gatgggcttcctcagggcactgac 

llllllllllllllllllllllll 



iCTTCTTCTGTGT 

fflfflflfflnnniiisi 

CTGTGGCCGCAGGAAGCTGCCCGT 



Query 785 ^ - - - 

sb,ct 562 mmm^^ 

Query 84 5 AGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGG 



)://blast.ncbi.nlm.nih.( 



5/20/2008 



NCBI Blast:M18930:Human hepsin mRNA, complete cds 



Page 15 of 55 



Sbjct 



M MM MM MM 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Sbjct 74 

Query 1025 GGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGC 1084 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct 802 GGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGC 861 

Query 1085 TGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCA 1144 

:cggccaggccctggtggatggcaaga 921 



Sbjct 



gtactatggccaacaggccggggtactccaggaggctcgagtccccataatcagcaatga 
:tgcaatggcgctgacttctatggaaaccagatcaagcccaagatgttctgtgctgg 
. 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I ! I i 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



CTACCCCGAGGGTGGCATTGATGCCT 3 I T TTTGTGTGTGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGA 
GGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 m i m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGG 
CTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGAT 

::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii m 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGAT 

144 5 CTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 14 97 

III 

1222 CTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 12 74 



; mRNA, complete 



Strand=Plus/Plus 



Mill 

ATGGCGCV Gt- l TILT ~G ACCCAAGGTGGCAGCT 



Query 3 06 CTCACTGCGGGGACCCTGCTACTTCTGAC.-T; 3 TTTT 3 3 33 3 33 "ATT 3T IGGGCCATTGTG 

sbjct 6i imi^^ 



Query 366 



Query 426 
181 



Sbjct 

Sbjct 241 

Query 54 6 

Sbjct 301 

Query 606 

Sbjct 361 

Query 666 

Sbjct 421 

Query 72 6 

Sbjct 481 

Query 786 

Sbjct 541 

Query 84 6 



GCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGAC 

TCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACC 

CACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGGCTTCTTCTGTGTG 
CACTCCGAGCTGGACGTGCGAACG 



GACGAGGGGAGGCTGCCCCACACCC-3- T T [ 3TGTGTGATTGC 

: I < I I I I I I Mill! I 

GACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTCCGTGTGTGATTGC 
CCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTG 

i MM MM MM MM 

CCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTG 



ACCGCATCGTGGGAGGCC^ 



CGCTATGATGGAGCACACCTTTTTT'AG 3 1 ■- r> ' 3 3T3 3TT I : ~ -. 3 3 3T 3TGGGTGCTGACA 
OGcIaIgaIggI^ 
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Query 1086 
Sbjct 841 

Sbjct 901 

Sbjct 961 
Query 1266 
Sbjct 

Sbjct 

Sbjct 



IMMMIIIIIIIIIIIIIIIIIM 



GTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCT 84 0 

GCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAG 114 5 

II MM MM III M' 

GCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAG 90 0 



6 TACTATGGCCAACAGG Z i ' J -T- -TCAGCAATGAT 12 05 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 

TACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGAT 960 

6 GTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGC 12 65 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGC 102 0 

TACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAG 1325 

M M M M M M M M M M M M M M M M M M M I M M M M M M M M M M I 

.021 TACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAG 1080 
32 6 GACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGC 13 85 

M M M M M M M M M M M M M M M M M M I M M M M M M M M M M M I 

.081 GACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGC 1140 
.386 TGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATC 1445 

I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 M I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

141 TGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATC 12 00 
446 TTCCAG 1451 
.2 01 TTCCAG 1206 



>emb | X07002 . 1 | HSHEPSL H. sapiens 1 

Length=1199 



GENE ID: 3249 



(Over 10 PubMed links) 



| hepsin (transmembrane protea 



; protease hepsin 
1) [Homo sapiens] 



Strand=Plus/Plus 



ACGTCGGGCTTCTTCTGTGTGGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ] I I I I I I I I I I I I I 
ACGTCGGGCTTCTTCTGTGTGGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAG 



GGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGG 
GGCCGCAGGAAGCTGCCCGT 



Query 765 

Sbjct 181 

Query 82 5 

Sbjct 241 

Query 885 

Sbjct 301 

Sbjct 361 

Query 1005 

Sbjct 421 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I ! I I I I I I I 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Ml 

Query 1065 CAGCCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbj ct 4 81 CAGCCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTG 

Query 1125 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



( AAGATGTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 :::: : : 

CCCAAGATGTTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGC 
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Sbjct 



1605 
1021 
1665 



MMMMMMMMMMMMMMMMMMMMMMMMMMMIMM 



13 65 ATTGTGAGTTGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTC 14 24 



M M M M M M M M M M M M M M I 

ATTGTGAGTTGGGGCACTGGCTGTGCCCTC 



142 5 AGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATG 14 84 



M M M M M M M M M M M M M M I 



AGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGAl 

GTGACCCAGCTCTGACCGGTGGCTTCTCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCC 



GGTCCAAGGACACCCTCCCTCCAGGGTCC 



M M M M M M M M M M M M M M I 



MMMMMMMMMMMMMMI 

GTGACCCAGCTCTGACCGGTGGCT1 3T 3GCTC 3A 3 37 3AT 3CC 9 6 0 

GGTGGTGGGATCCACGCTGGGCCGAGGATGGGAI 

mmmmmmmmmmmmmmmm: 

GGTGGTGGGATCCACGCTGGGCCGAGGATGGGAI 



GGTCCAAGGACACCCTCCCTCCAG3 'ACTCAGCCC 



CGAGACCACCCAACCTCACCCTCCTGACCCCCATGTAAATATTGTTCTGCTGTCTGGGAC 



1081 CGAGACCACCCAACCTCACCCTCCTGACCCCCATGCAAATA 114 0 



I I I I I I I I I I I I I I I I I I I I I I I I 

GCCTCCAGGGCCCGAGGTGATCCC 

nsnissnnnnnici 

CACAGTGGCGGGCCCACTCAGCCC 
:CACAGTGGCGGGCCCACTCAGCCC 



16 6 4 
1080 



IMMMMMMMMMMMI 



5 Sus scrofa mRNA, clor 



.27D02, expressed i 



Strand=Plus 
Query 66 



Sbjct 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



CCACGCCACCGCCTCTGCCTCC - A 



GACCCCGGCA - C - TACCTCGAGGCTCCGCCCCCAC CTGC 18 



Ml 



I Ml I MM M 



MM 



1 1 



i ,,~3ACTGA-CCC 

M M M M I I I I I I I I I I 

CAGGCCTGGAGACTGACCCCTAAAACCGGCACCGTGTCTC-A-GCTCTGCCCCCACCCGC 13 
TGGACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCA 



CGGACCCCAGGGTCCCGCCCCGGCCCAGGAGGTCAGCCGGGGGATCATTAACTAGAGGCC 1 
GTGACATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGG 



MMMMMMMI M 

CGGACCCCAGGGTCCCGCC 
GTGACATGGCGCAGAAGGA . . 

MMMMMI MM MM 

GTGACATGGCGGAGAAAGAG" 
CAGCTCTCACTGCGGGGACC 

MMMMMM MMMI 



GTGACATGGCGGAGAAAGAGGGTGGCCAGCCTGTGTCATGCTGCTCCGGACCCAAGGTGG 2 5 6 
CAGCTCTCACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCA 3 6 0 



TTGTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTG 



M M I 



M M I M M M 



M M M M M M M M M M 



MMMMMI Illlllll 



"MM II Mil Mill Mi. 



I I I I I I I I I I I I I I I I I I 

GAGGTCAGCCGGGGGATCATTA 

GACTGTGCCATGCTGCTCCAGA' 
I I I I I I I I I I I I I I I I I I 
GCCTGTGTCATGCTGCT'rCGG- 

_TCTGACAGCCATCGGGGCGGC. 

I MMMI I I I MM M 



I I I I I I I I I I I I I I I I I I I 



llllll MM Illlllll II 



I M I I I MM Ml I I I I I 



TCGCGCTCCAACGCCAGGGTGGCAGGGCTCAGCT 3 , }A1 ' - 'TT3CTCAGGTCA 
CTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGGCTTCTTC 



Ml II MMMMMI Illlllll 



I I I 

______ 

AGGTGC 

Illlllll 
CAAGGTGG 

CTGGGCCA 
Illlllll 



I II I I 



CTGACCCACTCAGAGCTGGATGTG-3, T __ 3 3TCGGGCTTCTTC 555 




Sbjct 73 5 CAGTCTTCGCTACGACGGAGCACACCTCTGTGGGGGATCCCTGCTCTCCAAAGACTGGGT 7 94 

Query 839 GCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTT 898 

sb.ct 795 wmmyL^^ 854 

Query 899 TGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGT 958 
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ctaccacgggggctatctt:- i -:~_~cagcgaggagaacagcaacgatat 
llllll I I I I llllll llllllll IIIMIIIIII M 1 Mill 
ctaccatggggactatctcccctttcgagaccccaacagtgaggagaacagcaatgatat 



TCCCAGCTGCCGGCCAGGCCCTGGTGGATGGC--3-T [ T "333CTGGGGCA 
Ill I I I I I I I I I I I II Mill MIIIIMM 



ACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCA 



GCAATGATGTCTGCAATGGCCCCGACTTCTACGGAAACCAGATCAAGCCCAAGATGTTCT 



TGTGTGAGGACAGCATCTCT T I TTGTGAGTTGGG 

MMMMMMMMMMMMMMMMIIIMMMMIIMM Mill MM 

TGTGTGAGGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATCGTGAGCTGGG 




I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



'TGGGGTGCTCCAGGAGGCCCGAGTCCCCATAATCA 



GCACTGGCTGTGCCCTGGCCCAGAAG 2 AG i 1 -_-J^GTCAGTGACTTCCGGG 

MM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCACCGGCTGTGCCCTGGCCCAAAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGG 
AGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 

MMMMMMMMIMMMMMMMMIMMMMIIMMMMIMIMM 

AGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCT 

C - - GCTGCGCA - GCCTCCAGGGCCCGAGGTGATCCCGGTGG - TG - 

I lllllll II II llllll Mill I 

GACCTGCGGCTTCTGCTCGCTGCGCTCGCCTCCAGGGCCCAAGCTGATCCAGGTGGCTCC 

CACGCTGGGCC-GAGGATGGGACGTTTTTCTTCTTGGGCCC 
III lllllll I MM I II I I I I I I I I I I I I I I I I I 
AGCCCCTCATGATGGGGTTCACCCTGGGCCTG-GGATAGAACATTTTTCTTCTTGGGCCC 

GGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCA 

[llllll IIIMIIIIII 

.'CCCTCCAAGGTCCTCTCTTCCACAGTGGCGGGCCCA 



GCATCACCCGAGCCTCACCCTCCTGACCCCCATGTAAATATTGTTCTGC 
■CTCCTGTCTAGGTGCCCCTGATGATGGGATGCTCTTTAAATAATAAAGAT 



>ref |NM_001080241.2 | S*i;Sc£i Bos taurus hepsin (transmembrane protease, 

gb | BC140636 . 1 1 Bos taurus hepsin (transmembrane protease, serin 

clone MGC:148484 IMAGE : 8196479 ) , complete cds 
Length=1919 



rine 1) (HPN) , mRNA 
1) , mRNA (cDNA 



• 1) [Bos taurus] 



Strand=Plus/Plus 



GCCTGGCCTAGCAGGCCCCACGC-CA-CCGC- -C-TCTGCCT'I rAGG^r-GCCCGCTGCT 
GCCAGGCCTAGCAGGCCCCTCGCGCTGCCGCT 



GCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCTAAACCCG-CACCATCTCT 119 

CGAGGCTCCGCCCCCACCTGCTGGACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCA 219 

I II llllllllllllllll Mllllll Ml Mill lllllllllll 

C-AG-CTCCGCCCCC-3^T - - T rCGGAGGTCAGCCG 177 



GGGAAT CATT AACT AGAGGCTGTGACATGG ~ ~ ~ - ~ 

T?TT?T?? A Tt???tt?TTTT? A T?T?T?t 
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Sbjct 



Query 578 



MINIM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM II II MMIMIMM 



GAGATGGGCTTCCTCAGGGCACTG- ACCCi ■: I CC 3-A 3-CTG' CG rGC 3AACGGCGGGCGC 
I I I I I I I I I I I I I I I I I I I I II II Ml I I I I I I I I i I I I I I MIMIIMM 
GAGATGGGCTTCCTCAGGGCGTTGGACTT-CTCGGAGCTGGACGTGCGGACGGCGGGCGC 

CAATGGCACGTCGGGCTTCTTCTGCGTG 



.clGCCGGlcMTCGcilTGliGGCGGckiGicicciGCcI 
GGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGGGATC 8 : 
MM I I I I I I I I I II I I I I I I I I I I I I I t I I I I I I I I I I I I I I MIMIIMM II 
GGGCAGGTGGCCGTGGCAAGTCAGTCTTCGCTATGATGGAGCACATCTCTGTGGGGGGTC 7 ' 

CCTGCTCTCCGGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGT 8 



I I I I I I I 



CGTGCTCTCTAGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCTGAGCGGAACCGGGT 83 5 



MMMMIMM Mill MMMM I II Mill I 



AGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCAACA 9 9 6 



.GCTGGGGGT 
MMMM 
ACTGGGGGT 

GCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTCTCCAGTC-CCCTGCCCCTCACA 
MIIIIMM Mill II II lllll I I I I I I I I I I I I I I I I 

gcgaggagaatag :aat 3A :a r : ; : :ct 3 3tc :a - :t :t : : sg - :ac :ctgcccctcaca 
gaatacatccagcctgtgtgcctcccagctgccggccatcc 
gag: 



II I II I 

; T . 



:aatgatgtctgcaacggccccgacttctacgggaac 
cagatcaagcccaagatgttctgtgctggctaccccgagggtggcattgatgcctgccag 



cIgtgtggca 
accaaagtca 

II II II II II 

ACCAAAGTCA 

AGCGGCATGG 



lllll MM 



CCCATGTAAA 
ATGGGATGCT 



M II II II I 



_-TGC 
I III 

CGTGC 



CCCGTGTGTCTCCCGGCTGCCGkLjCA^ 



MINI MINI Mill MMM 



TGCACGGTGA 

GAGGCTCGAG 
lllll MM,,,, 
GAGGCCCGAGTCCCC 

CAGATCAAGC 

MIIIIMM 

CAGATCAAGCCCAAGATGTTCTGTGCTG , - I , ,T 33CATTGATGCC1 



ACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCC 
II II II II II II II II I II II II II II II II II MM MM III II II II II II II II II 
ACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCC 



MMMM lllll MIIIIMM MM III MM 



III MM I III lllllll 



Mac 

GAAGCC 
II II II 
GAAGCC 

mTn 



II III 



.575 GGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCT - CCCTCCAGGGTCC 



- -TCTAGGTGCCCCTGATG 



IMIMIII lllllll I II 



TCTTG-TGCTCCTGAAG 



'll 
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>gb | BT029913 . 1 | ^JlS E 
Length=1704 
GENE ID: 508148 HPN I 



3 hepsin (transmembrs 



^transmembrane prote; 



(HPN) , mRNA, 



serine 1) [Bos taurus] 



md=Plus/Plu: 

?? M I I I I I I I I I I I I I I I I I I I 



I 1 1 1 I I III I Ml lllllllllll Ml Mill 

Sbjct 62 -GCACCATCTCTC-A-GCTCCGCCCCCACCTGCCGGACCCCAGGGTCCCGCCCCGGCCCC 118 

Query 208 GGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGCGCAGAAGGAGGGTGGCC 267 

lllllllllll I I I I I I I I I I I I I M lllllllllll I I I I I I I I I I I I I I I 

Sbjct 119 GGAGGTCAGCCGGGGAATCATTAACTAGAGGCTGTGACATGGCGGAGAAGGAGGGTGGCC 17 8 

Query 268 GGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGGACCCTGCTAC 327 

sb :ct i 79 ^1^^ 238 



Sbjct 
Sbjct 
Sbjct 
Sbjct 



I lllllll MINIMI II MMMIMMIMI MIMM lllllllllll I 



AGGAGCCACTGTATCCAGTGCAGGTCGGCCCGGCGGATGCTCGGCTCACGGTGTTCGACC 3 5 8 

AGACGGAAGGG-ACGTGGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGA 50 6 
MM II III llllllll Mill I I I I t I [ I I I I I I I I 1 I llllllll II II 

AGACAGA-GGGCACGTGGCGCCTGCTTTGCTCCTCGCGCTCCAATGCCAGGGTGGCGGGG 417 



CTCAGCTGCGA3G \TGG TT 'T^GGGCACTG-ACCCACTCCGAGCTGGACGTGCG 

: II II III MMMIMIMM 

: ITTGGACTT-CTCGGAGCTGGACGTGCG 



CTCAGCTGCGAC 



Query 62 6 
Sbjct 537 
Query 686 



Query 806 



III MM MM llllll II 1 1 1 1 llllllll MM 

i'GCCCGGAGGTTGCTCGAGGTCCTCTCCGTGTGCGACTGTCCCAGAGGCCGTTTCCTGGC 5 9 6 

CGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCG 74 5 

III lllllllllllllllll I I II Mill I! II Mill Mill MM 

TACCAGCTGCCAAGACTGTGGCCACCGAAAACTGCCGGTCGATCGCATTGTGGGCGGCCA 65 6 

GGACACCAGCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCT 80 5 

GGACACCAGCCTGGGCAGGTGGTC 716 

?i?i????? A i?? c i??i?i? CG ???t?i???i??i?t?tTy??i??t?i??n??? G ?t 



Sbjct 
Sbjct 
Sbjct 



GCGG_-_-_CCGGGTCCTGTCCCGkTGGCGkGTGTTTGCCGGTGCCGTGGCCCkGGCCTCTCC 

I I I I I I I I I I I I I I I II MINIMI INN \\\\\\\\\ I II II 

gcggaaccgggtcctatcacgatggcgagtgtttgctggtgctgtggcccagacttcacc 

ccacggtc - tgcagctgggggtgcaggctgtggtct accacgggggctatcttccctttc 

ccatgg^Igcaa^ 

gggaccccaacagcgaggagaacagcaacgatattgccctggtccacctctccagtc - cc 

GAGACCCCAACAGCGAGGAGAATAGCAATGACATCGCCCT - CACC 



NNNININN lllllllllll INN III 



GATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGGCC 



NNNINNIN II INN NINNINN 



1015 GATGGCAAGATCTGCACGGTGACTGGCTGGGGCAACACGCAGTACTACGGCCAACAGGCT 



INN lllllllllll IIIIIIIIIIIIIINII 



INN II IIIIIIIIIIIIIIIIIIIIIINNI 



'GTGTGTGAGGACAGCATCTCTCGGACG 
lllllllllll I I I I I I I I I I I I I I I 

1195 GATGCCTGCCAGGGCGACAGTGGTGGCCCCTTCGTGTGTGAGGATAGCATCTCTCGGACG 



"I I I I I I I I III 



llllllll INI 



I I 
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541697.2 | ^JES PREDICTED: Canis familiaris sin 
: protease, serine 1) (LOC484583), mRNA 



Liar to Serine protease hepsin 



hepsin (transmembrane protease, 



Strand=Plus/Plu: 



3CTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGA 

I I I I I I I I I I I I I I I I I I I I I I 'MINIM!" I I I I I I I I I I I I I 

CGCCTCTGCCTCCGGGCC-ACCGC--C-GCGGGGCCACCATGCTCCCGCCCAGGCCTGGA 
GACTGACCC-GACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGCTGGACCCCAGGGT 



Sbjct 114 
Query 2 54 



MMMI Mill lllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

CCCACCCCGGCCCGGGAGGTCAGCCGGGGAATCATTAACAAGAGGCCGTGACATGGCGGA 17 3 



MMMI Mill 

:ccaccccggccc 

gaaggagggtggccggactgtgccatgctgctccagacccaaggtggcagctctcactgc 313 
lllllllllll MMIMMMMIIMI III IMMMIMMIMMIMIMM 

GAAGGAGGGTGCCCGGACTGTGCCATGCTGTTCCGGACCCAAGGTGGCAGCTCTCACTGC 2 3 3 

GGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTG-GCTGTTC 3 72 
I II II II II I 



MM MUM 



MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

AGCCGGGGAATCATTAACAAGAGGCCGTGA 
GCCATGCTGCTCCAGACCCAAGGTGGCAGC 

MINIMI III MMMIIININN 

GCCATGCTGTTCCGGACCCAAGGTGGCAG^ 

AGCCATCGGGGCGGCATCCTGGGCCATTG 
II II MMIIIININ 



lllllll II MMMI INI III 



I INI M INI INI III i 

CTCATGGTGTTCGACGACACGG-AGGGCACGTGGCGGCTGCTGTGCTCCTCGCGCTCCAA 



CGCCAGGGTAGCCGGACTCAGCTGCG- - i I I TCAGGGCACTGACCCACTC 




Sbjct 651 CATCGTTGGAGGCCAAGACACCAGCCTGGGCAGGTGGCCGTGGCAAGTCAGTCTTCGCTA 710 

Query 791 TGATGGAGCACACCTCTGTGG-GGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCG 84 9 

Sb.ct 711 CGATGGAGCACACCTCTGTGGAGGG-TCCCTGCTGT 7 S 9 

Query 850 CCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCG 90 9 
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I II lllllllllll 

'CCCCGAGCGGAACCGGGTCCTGTCTCGGTGGCGAGTGTTTGCCGGCGCCG 82 
TGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGG 



GCCAC 



^GGCCCTGGTGGACGGCAAGATCTGCACGG' 



ACGGCCAACAGGCTGGGGTGCTCCAG3A33 3 3AACGA 



Sbjct 



Sbjct 
Sbjct 
Sbjct 
Sbjct 



TTCTC 
TTGTC 



■GCTGCGCA-GCCTCCAGGGCCC 1533 



I llllllll II II 



Sbjct 89( 

Query 103 0 ACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCG 
I I I I Mill I I I I I I I I I I I I I I I I I lllllllllll lllllllllll II I I I I 
Sbj ct 950 ACCTGTCCAGCCCCCTGCCCCTCACAGAGTACATCCAGCCCGTGTGCCTCCCGGCGGCCG 

Query 10 90 GCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACT 



I I I I I I I I I I I I I 

CCCCGAGGGCGGCATCG_~T 3 1 , 3"CCCTTCGTGTGTGAGGA 

CAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGG 
Mill! Mill III 



1388 TGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGT d 3 3 3-GTGGATCTT 



1303 TGCCCTGGCCCAGAAGCCAGGTGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTT 
144 8 CCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGC 



IIIIIIIIMMIIII 



Lf 1 1 1 1 1 J 1 1 1 11 1 1 1 



CCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCTGTGGC 



145- 



»ref |XM_001157575.1 | 13 PREDICTED: Pan troglodytes hepsin (transmembrane prote; 
L) , transcript variant 2 (HPN) , mRNA 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 



Query 25: 
Sbjct 



GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 2 5 0 
IMIIMIIMIIMIMIIMIIMIIMIIMIIIIIIIIIIIMMIIMIIMIII 

GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 2 0 5 

GCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCAC 310 

GCAGAAGGAGGGTGGCCGGACTGTGCCATGCTG 2 S 5 

TGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 3 7 0 

IgCGGGGACCC^^ 325 

TCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 4 3 0 

IcicclcAGGAG^ 385 

GCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCGCTCCAA 4 9 0 

: MM MM MM III IM: 

GCTCATGGTCTTTGACAAGACGGAAG 3ACGT |TTTT,,T--445 



CGAGCTGGACGTGCGAACGGCGGGCGCCkkTGGCkCGTCG33CTTCTTCTGTGTGGACGA 610 

CGAGCTGGACGTGCGAACGGCGG^ 5 S 5 

GGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCA 62 5 

TTT??TTTT?TTTT??T??TT?TT??TTTT?T?TTf??T?TTfTTT?T?????TTTT??T 
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AGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCG 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



CATCGTGGGAGGCCGGGACACCAGCTTGGGCCGOT 

TGATGGAGCACACCTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGC 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TGATGGAGCACACCTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGC 




CCTCTCCAGTCCCCTGCCCCTCACAGAATACAT" - rGTGT T "CCAGCTGCCGG 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii:::: 

CCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGG 
CCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTA 

: ' I ' I I I I I I I , I < : 

CCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTA 
TGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGT 

II MM MM III III' 

TGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGT 



Score = 968 bits (524), Expect = 0.0 
Identities = 528/530 (99%) , Gaps = 0/530 (0%) 
Strand=Plus/Plus 



Sbjct 



Sbjct 



TTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCC 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCC 
TTTGTGTGTGAGGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

~ ■ ■ ~ TTGGCGGCTGTGTGGCATTGTGAGT 



tttgtgtgtgaggacagcatctcto3gao3o:a>:gttg: 



1464 
1614 
1524 
1674 
1584 
1734 
1644 



TGGGGCACT 3 5CTG1 " , - SAAGCCAGGCGTCTACACCAAAGTCAGTGACTTC 

TGGGGCACTGGCTGTGCCCTGGCCC 



CGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAG 
ATCCACGCTGGGCCGAGGATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGG 

MMMMMMM M M M M M M M M M M M M M M M M M M M M M M I 

ATCCACGCTGGGCCTAGGATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGG 
ACACCCTCCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCAC 

IMMIIMIIMIIMMMIMM 



GGTGCCCCTGATGATGGGATGCTCTTTAAATAATAAAGATGGTTTTGATT 1783 

MMMMMMM M M M M M M M M M M M M M M M M M I 

GGTGCCCCTGATGACGGGATGCTCTTTAAATAATAAAGATGGTTTTGATT 1693 



>emb|CR597177.1 | I1H full-length cDNA clo: 
Cot 10 -normalized of Homo sapiens (human) 
Length=1828 



t CS0DJ003YL08 of T cells (Jurkat cell lir 



serine 1) [Homo sapiens] 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



CAGGGCACTGACCCACTCCGAGCTG 4 3 4 
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sbjct 495 mi^MMMM^^ 

Query 713 GAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGSACACCASCTTGGGCCGGTGGCCGTG 



I I I I I I I I I I I I I I I I 

555 GAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTG 6 



Query 773 

Sbjct 615 

Query 833 

Sbjct 675 

Query 893 

Sbjct 735 



GCAAGTCAGCCTTCGCT.-.- ! 2 



GCAAGTCAGCCTTCGCTATGATGGAGC~C~"T T ^333,- 674 



Sbjct 
Sbjct 
Sbjct 



llllllllllllllll 



AGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGC 
1 5 3 TGTGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAA 101 

III MM MM III III 

9 5 TGTGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAA 854 

.013 CGATATTGCCCTGGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGT 107 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I 
; 5 5 CGATATTGCCCTGGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGT 914 



IgccggccaggcccU 
gggcaacacgcagtactatggccaacaggccggggtactccaggaggctcgagtccccat 

GGGCAACACGCAGTACTATGG 
Query 1193 AATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGAT 

sb : ct 1035 aa!cagcaa!ga!g!^ 



1034 
1252 



Sbj ct 1155 CTTTGTGTGTGAGGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAG 

Que ry 13 73 TTGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTT 

Sbj ct 1215 TTGGGGCACTGGCTGTGCCCTGGCCCAGAAGCC 

Query 1433 CCGGGAGTGGATCTTCCAGGCCATAAAG 1460 

M M M M M M M M M M M M M M 

Sbjct 1275 CCGG T TT 3 3C CATAAAG 1302 



Score = 566 bits 0 
Identities = 310/31: 
Strand=Plus/Plus 



Query 14 5 9 
Sbjct 1517 



(99%) , Gaps = C 



AGACTCACTCCGAAGCCAG" , I ± , T , TTCTCGCTGCG 




MMMIIIIIIIIIMIII^ 



TTAAATAATAAA 177 0 
M M M M M M 
TTAAATAATAAA 182 8 



Strand=Plus/Plus 
Query 86 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [[ M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
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CAGGAGGTCAGCCAGGGAATCATT 



Strand=Plus/Plus 

Query 4 05 GTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGG 4 64 

I MM MM MM III MM < 

Sbjct 176 GTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGG 235 

Query 465 CGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATG 524 

Sbjct 236 CGGCTGCTGTGCTCCTCGCGCT 2 95 
Query 52 5 GGCTTCCTCAGG 536 

llllllllllll 
Sbjct 2 96 GGCTTCCTCAGG 3 07 



>emb|CR592189.1 | i 

(human) 
Length=1212 



i full-length cDNA clor 



5 of Fetal liver of Homo sapiei 



[Homo sapiens] 



Strand=Plus/Plu: 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



GCTTCC-CGGAGCGG^J-.CCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCC 

MINI I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCTTCCTCAG-GCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCC 
CAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTAT 

' 1 1 1 1 : 1 1 ' : ' ' 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



GCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGC 1154 

GCCCTGGTGGATGGCAAGATCTGTACCGTG 596 

CAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGCAAT 1214 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIII 

CAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGCAAT 65 6 

GGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCCGAG 12 74 
' I ' I I I I I I I I I ! i 

GGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCCGAG 716 

fGTGTGTGAGGACAGCATC 13 34 



12 75 GGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCi 



MMMMMMM™ 776 

1335 TCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGCTGTGCCCTG 13 94 

777 TCTCGGACGCC^ S36 

13 95 GCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCC 14 54 



GCCCAGAAGCCAG^ 



Query 1455 

Sbjct 8 97 ATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCT 95 6 

Query 1515 TGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAGGATG 

Sbjct 957 4L^^ ( L ( LJ1111111J1111J111I1JI1JLL111J111J11JI1111LJ11J11 

Query 1575 GGAC 



rCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCT 1634 



MIIIIIIIIIIIIIIIIIIIMM 



CTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAG 

??TTTTTTTTTTTTTT?T??TTT?T?T?T?T??T?T?TTTTTT????TTTTTT T TTTTT? 
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CCATGTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCTGATGACGGGATG 



CT CTTT AAAT AATAAA 1212 



Strand=Plus/Plus 

Query 85 TCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCG 

li MM MM MM 

Sbj ct 1 TCCAGGCCGCCCGCTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCG 
Query 14 5 ACCCCGGCACTACCTCGAGGCTCCGCCCCCACCTGCTGGACCCCAGGGTCCCACCCTGGC 



Sbjct 



CCAGGAGGTCAGCCAGGGAATCATTAACAA3.-3 , " 1 - 3AAGGAGG 



Identities 
Strand=Plu 



Sbjct 



2 3 7 CGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATG 2 96 
52 5 GGCTTCCTCAGG 536 



GGcllcclcAGG 



>emb|CU693029.1 | Synthetic 
3 ' read HPN mRNA 
Length=1184 



construct Homo sapiei 



Strand=Plus/Mir 



GGGC - CATTGTGGCTGTTCTCCTCAGGAGTGAC - CAGGAGCCGCTG - TACCCAGTGCAGG 

MM Ml MM I 1 1 I Ml MM I I MMMM Ml II MM I III 

GGGCAC ATGGTGGGT -TTTTT CTCGGGAG - GGCACAGGAGCC - CTGTTA - CCAGGGGAGG 



TCAGCTCT ; 



C - GGCTCATGGTCTTTGACAAGACGGAAGGGACGT - GGCGGC 

I III II III II MINIM lllllllllll I II II 

■ GCCTCGGGTTCA - AGTTTTTGACAAAACGGAAGGGAC - TGGGGGGT 
CGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCT 
iG! T ciicCCci G iU:U,lU_!4.LU G UGGii!-GGc! 
TCCTCAGGGCACTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGT 

CGGGCTTCTTCTGTGTGGACGAGGGGAGGCTGCCCCACACCCAGAGGCTGCTGGAGGTCA 



Ml I 

TGGGG1 



ITTTTTCTG-GGGGACGAGGGAAGGCT - I :■: : - 3AC rCAGAGGCGGCTGGA 



Query 649 TCTCCGTGTGTG - ATTGCCCCAGAGGCCGTTT Tl -T 'T 3CCAAGACTGTGGC 707 

III II I I I II MM MMMM II MMMMMMMMMMMIMM 

Sbjct 866 TCTTCG-GGGGGAATGGCCCGAGAGGCCGGTTGTTGGCCGCCATCTGCCAAGACTGTGGC 808 

Query 708 CGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGG 767 

Sbjct 807 CGCAGGAAGCTGCCCGTGG 748 

Query 768 CCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGGGATCCCTGCTCTCC 82 7 

Sbjct 747 UWkLUUL^^ 688 



TGGCGAGTGTTTGCCGGTGCCGTGGC 3 3 1 l 3 -3CTGGGGGTG 

MMMMMMMMMMMMMMMMIMIIIIIIIIIMMMMIMIMM 

TGGCGAGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTG 

?TTT?TTTTTT?TT??T?T?TTT?TTT?TT???TTT?TTTT????TT?TT??TT?T?TT? 



Sbjct 



Query 1008 AGCAACGATJ-TTS" T r AATACATCCAG 

Sbjct 507 AGCAACGa!a!!gCC^ 

Que ry 1068 CCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACG 
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Sbjct 



GGCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTC 

iiiiiiiiij j m iiij in iijiiiiij j ii^by^jj iiiiJiiiijm.il , ' ' ' 



GGCTGGGGCAACACGCAGTACTATGGCCAACAGGC 



3GTACTCCAGGAGGCTCGAGTC 



IM Mil MM III M 



AAGATGTTCTGTGCTGGCTACCCCGA3G 3 1 i 3 TTGA1 TG 3 33CGACAGCGGT 

M M M M M M M M M M M M M M M M I MM MM MM M M M M M M M I 

AAGATGTTCTGTGCTGGCT-r— !GAG 1 I ATTGA1 i 1 3 7 3- ?~ 3 "GGT 



Query 1308 



Sbjct 



GTGAGTTGGGGCACTGGCTGTGCC " 33GTCTACACCAAAGTCAGT 88 



GACTTCCGGGAGTGGATCTTCCAGGCCATAAAGA' I - 'T 'CGAAGCCAGCGGCATGGTG 28 



ACCCAGCTCTG 14 98 
M M M M M I 
ACCCAGCTCTG 17 



>ref |XM_001157514.1 | E PREDICTED: Pan troglodytes hepsin (transmembrane protej 

1) , transcript variant 1 (HPN) , mRNA 

Length=2104 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 



Score = 1476 t 
Identities = £ 
Strand=Plus/P] 



Sbjct 



Query 52 3 

Sbjct 887 

Query 583 

Sbjct 947 

Query 64 3 

Sbjct 1007 

Query 703 

Sbjct 1067 

Query 763 



CAGTGCAGGTCAGCTCTGCGGACGCTCGGCTC&TGGTCTTTGACAAGACGGAAGGGACGT 82 6 

mmmImImmimimmmm 2 

TGGGCTTCCTCAGGGCACTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATG 582 

TGGGCTTCCTCAGG^ 946 



rTCTTCTC 



I MMMMMMMMM1MMMMMMMMMMI 



AGGTCATCTCCGTGTGTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACT 

MMMMMMMMMMMMMMMMIMMMMMMMMMMMMMI 

AGGTCATCTCCGTGTGTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACT 
GTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCC 

M M M M M M M M M M M M M "' 



iCCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCC 



GGTGGCCGTGGCAAGTCAG ? T 1 r ,33GATCCCTGC 

GGTGGCCGTGG^ 

TCTCCGGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGT 
TCTCCGGGGACTGGG^^ 

CCCGATGGCGAGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGG 

MMMMMMMMMMMMMMMMIMMMMMMMM MMMMM 

CCCGATGGCGAGTGTTTGCCGGTGC3 3T i 1 1 i T ™CACGGCCTGCAGCTGG 



1307 
1003 



AGAACAGCAACGATATTGCCCTGGTCCACCTC1 : ' :TCACAGAATACA 1062 

AGAACAGCAAC^^ 1426 



TCCAGCCTGTGTGCCTCCCAGCTGCCGGCCAGG 



1486 
1182 



»://blast.ncbi.nlm.nih.j 



5/20/2008 



NCBI Blast:M18930:Human hepsin mRNA, complete cds 



Page 28 of 55 



TGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTC 



gagtcccca: 



Strand=Plus/ 
Query 1254 
Sbjct 1573 
Query 1314 



TTCTGTGCTGGCTACCCCGAGGGTGGCkTTGkTGCCTGCCkGGGCGkCAGCGGTGGTCCC 

' 1 1 1 1 1 1 1 1 1 1 1 1 1 > I : 



Sbjct 1993 
Query 1734 



^"readM 



'AATCAGCAATGATGT 



"Ml 



TTTGTGTGTGAGGACAGCATCTCTCGGACGCC-. ? V :- :ATTGTGAGT 

Sbj ct 163 3 TTTGTGTGTGAGGACAGCATCTC^ 
Que ry 13 74 TGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTC 

MMMMMIMM 

Sbj ct 16 93 TGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTC 
Query 14 34 CGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGA 
Sbjct 17! 

Query 14 94 CTCTGACCGGTGGCTTCTCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGG 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIMMMIIIIIIIIIIIII 
Sbj ct 1813 CTCTGACCGGTGGCTTCTCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGG 

Que ry 1554 ATCCACGCTGGGCCGAGGATGGGACGTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGG 



'CTTCTTGGGCCCGGTCCACAGGTCCAAGG 




GGT _ _ - - 1^83 



Synthetic construct Homo sapiej 



2102 
IMAGE : 1 



Query 305 



Sbjct 136 

Query 425 

Sbjct 196 

Query 485 

Sbjct 256 

Query 54 5 

Sbjct 316 

Query 605 

Sbjct 436 

Query 72 5 

Sbjct 496 

Query 785 



Score = 1465 bits (793), Expect = 0.0 
Identities = 972/1052 (92%), Gaps = 38/1052 (3%) 
Strand=Plus/Plus 

Query 24 5 CATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGC 
Sbjct 16 CATGGCGCAGAAGGAGGGTG^ 



TCTCACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGT 13 5 

CGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCGCG 4 84 

OGcIoGGcIcaIg^ 255 

CTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGAC 54 4 

CTCCAACGCCAGGGTAGCCGGACT 315 



MMMMMIMM 



GAGGCCGTTTCTTGGCCGCCATCT 
GGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAGTCAGCCT 
GGACCGCATCGTGGGAGGCCGGGACACCAGCT 

T?T?TtTTTT?TTT?t?T??T?TTTT?T?TTT???T??T?T???T??T?T?T?TT?TTT? 



Query 84 5 
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TGCCGTGGCI 



CACGGGGG-CTATCTTCCCTTT-CGGGACCCCAACAGCGAGGAGAACAGCAACGA 



Sbjct 734 C - CGGGGGGCT - 
Query 1020 GCC-CTGGTCCACCTCTCCAGT- 



-AATAAATCCAGCCTGGTGT 84 
CCCAGCT-GCCGGCCAGGCCCTGGTGGA-TGGCAAGATCTGT-ACCGTGACGGGC 11. 



I I ! 

AG-CCTC 



ACCCGCGAGGAGAACAGCAACGAATATT 
-GCCCCT- CACAGAATACATCCAGCCTG 



CCGGGGACA-CACCCAGTACTTTGCGC-AACCGGCCGCGGGTATTCCCGGAGGTTCTAGG 9 64 

TCCCC - ATAATCAGCAA - TGA - TGTCTGCAATGGCGCTGACTTCTATGGAAAC - CAGATC 1241 
II II Mill I III I I I III llllllllll II III I I III I I I I I 

TCTCCTATAATAA - CAAATTAATTTCTCCAATGGCGCTTACCTCTCTTGTAACTCCGATC 102 3 



?GGTCTAC 



1024 TGACCCCAACA-GTTTTTTGCTTGTCTACCCC 



II OS Homo sapier 
to SERINE PROTEASE HEPSIN (EC 3.4 
Length=2175 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



md=Plus/Plu£ 



Ouerv 6 0 5 c ck-kGkcTGTGGc CGCAGGAAGCTGC ■? C GTGGAC c GCATCGTGGGAGG': CGGGACAC CA 7 5 3 
III 

Sbjct 870 CCACAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCA 92 9 

Query 754 GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGG 813 

Sb : ct ,30 GCTTGGGCCGGTGGCCGTG 989 



Query 874 GGGTCCTGTCCCGATGGCGAGTGTTTGC , i 1 - IT CCCACGGTC 

Sbj- ct 1050 GGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTC 

Query 934 TGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCA 

Sbj ct 1110 TGCAGCTGGGGGTGCAGGCT 

Query 9 94 ACAGCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTCTCCAGTCCCCTGCCCCTCA 

SbjCt 1170 ACAGCgU^ 

Query 1054 

Sbj ct 12 3 0 CAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGGC 

Que ry 1114 TCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCC 

MINI MM III II 

Sbjct 12 90 TCTGTACCGTGACGGGCTGGGGCAACA . A cAGGCCGGGGTACTCC 



1350 
1234 



aggaggcIcgagIcc^ 

accagatcaagcccaagatgttctgtgctggctaccccgagggtggcattgatgcctgcc 
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Strand=Plus/pr 
Query 14 5 9 
Sbjct 1851 



AGACTCACTCCGAJ-GCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGCTTCTCGCTGCG 

Mi MM MM Ml < 

AGACTCACTCCGAAGCCAGCGGCATGGTGA "C : -.3 -'I T " J' !!• IGGTTCTCGCTGCG 



1519 CAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAGGATGGGAC 



lllllllll 




TCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTGACCCCCAT 



2 031 TCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTGACCCCCAT 2 1 
16 9 9 GTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCTGATGATGGGATGCTCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 

2091 GTAAATATTGTTCTGCTGTCTGGGACTCCTGT-T- , IT 



TTAAATAATAAAGATGGTTTTGATT 17 83 



Sbjct 



MMMI 
GACGCTCC 

CGCTCCAi 
MMMI 



Score = 233 bits (12 
Strand=Plus/Plus 



CAGGGCACTGACCCACTO 



M M M M M M M 



M M M M I 

TCCACAGT" 

GTAAATAT 
M M M M 
GTAAATAT 

TTAAATAA 

MMMM 

TTAAATAATAAAGATGGTTTTGATT 2175 



MMMMMMMMM 



1910 
1578 



.TGGGATGCTCT 



GACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGTGGCGGCTGCTGTGCTCCTCG 



M M M M M M M 



GGGTGGAAGGGGAGGGTAGGGGGAGTGAGGTGGGAGGA' 



MMIMMMMMMMMMMI 



Score = 322 bits (174 
Identities = 174/174 " 
Strand=Plus/Plus 

Sbjct 

Query 423 

Sbjct 219 GACGCTCGGCTCATGGTCTTTGAcMGACGGMGGGACGTGGC 2 78 

Query 483 CGCTCCAACGCCAGGGTAGCCGGACTCAGCTC 536 



MMMMMMMMMMMMM 



AGGAGATGGGCTTCCTCAGG 3 3 : 



'CGAGCTGGACGTGCGAACGGCGGGCGCCAJ./TGG'rA'rGT'I'GGG 



52 0 CGTGTGGTGA 52 9 



Identities 
Strand=Plu: 



AGACCCAAGGTGGCAGCTCTCACTGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCG 60 
GCATCCTGGGCCATTG 3 63 

I Ml I Ml 1 1 1 1 1 Ml 76 



Identities 
Strand=Plu: 



Query 658 GTGATTGCCCCAG~ 3 4 rTTCTl , ' -T'T3:CAAG 699 

Sb.ct 657 M^^L^m^^^A 698 



>ref |XM_001093460.1 | PREDICTED: Macaca r 

1) , transcript variant 1 (HPN) , mRNA 
Length=2174 



jlatta hepsin (transmembrane protease, ! 
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III M MM MM III !l 



Query 814 GATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
8b] ct 987 GGTCCCTGCTCTCCGGGGACTGGGTGCTGACAGCTGCCCACTGCTTCCCGGAGCGGAACC 

Query 874 GGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTC 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII I 
Sb] ct 1047 GGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGCC 

Query 934 TGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCA 

MIIMI M 

™ TCGGGACCCCA 

Query 1054 CAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MM III MM II 
Sbj ct 1227 CAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCTGGCCAGGCCCTGGTGGATGGCAAGA 

Que ry 1114 TCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCC 

IMMMMIM 

Sb]Ct 1287 TCTGTACCGTGACGGGCTGGGGCAAC~ 3CAGTA I 1 3 I AA CAGGCCGGGGTACTCC 



Sbjct 



AGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAA 
AGGGCGACAGCC 



3GTGGTCCCTTTGTGTGTGAGGACAGCATCTCTCGGACGCCACGTTGGC 
^ L vGGTGACAGCGGTGGTCCC 



J 1 3 TCTTCCAGGCCATAAAG 1460 

lllllllllllllllllllllllllllllllllllllllllllllll 

ACACCAAAGTC^ 11 - ]T T it -T-AAG 163 3 



Identities^ 316/325' 
Strand=Plus/Plus 



Query 175 9 TTAr-_- i i ' tSTTTTGATT 

sb.ct 2148 !!iii!ii!iiiGi!GG!!!!Gi!! 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiiiiii 



cagcctccagggcc^^ 



1699 GTAAATATTGTTCTGCTGTCTGGGACT r 1 I T 3 -T33GATGCTCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I > 1 1 1 M I M 1 1 1 i 1 1 llllllllll 

2 088 GTAAATATTGTTCTGCTGTCTGGGACT 7 3ATGACGGGATGCTCT 



Score = 300 bits (162), Expect = 2e-77 
Identities = 170/174 (97%), Gaps = 0/174 (0%) 
Strand=Plus/Plus 



ll MMMMM 



Ml llllllll 
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CGCTCCAACGCCAGGGTAGCCGGACTCAGCTG:3- - 1 TT::TCAGG 536 

lllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGCTCCAACACCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGG 3 31 



Strand=Plus/Plus 



Sb j ct 3 9 9 CAGGGCACTGACCCACTCCGAGTTGGACGTGCGAACGGCGGGCGCCAACGGCACGTCAGG 




Strand=Plus/Plus 



iACCCTGCTACTTCTGACAGCCATO" 



GACCCAAGGTGGCAGCTCTCACTGCG3J - TGCTACTTCTGi - i "ATCGGGGCGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill"" 
GACCCAAGGTGGCAGCTCTCACTGCAGGGACC — " 

CATCCTGGGCCATTG 3 63 
I I I I I I I I I I I I I I I 
CATCCTGGGCCATTG 7 5 



3 | [13 



Mus musculus hepsin (Hpn) , transcript variant 2, mRNA 



GENE ID: 15451 Hpn | hepsin [Mus musculus] (Over 10 PubMed links) 

Score = 1303 bits (705), Expect = 0.0 
Identities = 1343/1644 (81%) , Gaps = 71/1644 (4%) 
Strand=Plus/Plus 

Query 184 ACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTG 24 3 



llllllllll II III II 



14 8 ACCCCAGGGTTCCGCCCCAGCCCAACAGGTCAACCTGGGAATCATTAACAAGAGTCCCTG 207 



Query 244 
Sbjct 208 
Query 303 



ACATGGCGCAGAAGGAGGGTGGCCGGACTG-TGCCATGCTGCTCCAGACCCAAGGTGGCA 3 02 



I I I I I II lllllllllll 



ACAT - G - GC - GAAGGAGGGTGGCCGGACTGCAG - CATGCTGCTCCAGACCCAAGGTGGCA 2 53 
GCTCTCACTGCGGGGACCCTGCTACTTC -TGACAGCCATCGGGGCGGCATCCTGGGCCAT 3 61 



lllllll II I 
GCTCTCATTGTC 



TGTGACCATCCTACTGCAG-, 



I I I I I I Mill III 



1 1 MMM 1 1 MMMMMMMMM I 1 1 



MMMM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



I I I I I I I I I I i ! i I I I I I I I I I I I I I 

ITGGGTACCCTGCTG-TTCCTGACAGGCATTGGGGCCGCGTCCTGGGf 



ICCAT 



CAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCT - 



Query 420 GCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGA - CGTGGCGGCTGCTGTGCTC 478 



M MM Mill MM MMMM 



GGGGACTCACGGCTTGCGGTGTTTGACAAGACGGA - GGGAACGTGGAGGCTACTGTGCTC 
CTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGC 

III MMMM MMMM II II III MM MMMMMMII MMMM 

CTCACGCTCCAATGCCAGGGTGGCA TTCTCAGGGC 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Query 599 
Sbjct 560 
Query 658 



-GGACTGCCTCTGGCTCAGAGGTTGCTGGATGTCATCTCTGTAT 
GTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGC 

MM II II MMMM III II I MM II M M M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTGACTGTCCTAGAGGCCGATTCCTGACTGCCACCTGCCAAGACTGTGGCCGCAGGAAGC 
TGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAG 



MM lllllllllll MM 



67 9 TGCCGGTGGACCGCATTGTGGGGGGCCAGGACAGCAGTCT - GGGAAGGTGGCCGTGGCAG 7 3 7 



MMMM II MMMM 



Query 896 



Mill Mill II MM 



MM MMI Ml II III MMMMIMM 



I MMMMMMII MMMM II MMMM 



I II MMMMIMM II MMMM 



GTTTGCCGGTGCCGTGGCCCAGG- CCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTG 954 



I MM Mill I I MM MMMM lllllll 



ATTTGCTGGTGCTGTAGCCC-GGACCTCACCCCATGCTGTGCAACTGGGGGTTCAGGCTG 915 
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Sbjct 

Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



Sbjct 



,11 MINIM lllllllllll I 



Query 1014 GATATTGCCCTGGTCCACCTCTCCAG-T V 'C AGAATACATCCAGCCTGT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :: : 

Sbjct 975 GACATTGCCTTGGTCCACCTCTCTAGCTCCC-TGCCTCTCACAGA 

Query 1073 GTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTG 



1760 
1741 



I I I I I I I I I I II II 



1034 GTGTCTCCCTGCTGCGGGACAGGCCCTGGTGGATGGCAAGGTCTGTACTGTGACCGGCTG 



III Mill MM MMMMMIIMI . 



193 AATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGAT 



II Mill II II Mill II I Mill 



CATAAGCAACGAAGTTTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAGCCCAAGAT 
GTTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCC 



CTTTGTGTGTGAGGACAGCATCTCTCGGACGCCAC 
I I I I I I I I I I I I I I I I I I I I I I I I MM 
CTTTGTGTGTGAAGACAGCATCTCTGGGA' 



Mill II lllllll 



I I I I I I I I I I 



13 34 CTGGGGTACGGGCTGTGCTTTGGCCCGGAAGCCAGGAGTGTACACCAAAGTCACTGACTT 
1433 

13 94 CCGGGAGTGGATCTTCAAGGCCATAAAGACT^ 



I I I I I I I I I I I I I I I 



MM MMMI Mill 



MMI MMI II II Mill 



M 1 1 IIIM 



4gccacmgcgacagtggaggccc 



.CGTTGGCGGCTGTGTGGCATTGTGAG 

. MMMM lllllllllll II 

-1 - T ,-T-TGTGGCATTGTAAG 



M MINIM II II II 



GCTCTGA - CCGG - - TGG - CT - - - T - CTC - G - - CTGCGC - AGCCTCCAGGGCCCGAG - - G - 
GCCCTGATCCCGCCTCATCTCGCTGCTC 



T- -GAT- -C-CC-G-- 



- - GTGGTGGGATCCACGCTGGGCCG - AGGATGGGACGTTTTT 



TCTGGTGGCTCCAGCCCCACGTGGTAG^ 
CTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCTTCCACA 
II II I II llllll NNIIINI I I INI II I lllllllllll 
CTGCTCAGATCCAGTCCACGGGTCCAAGGATGC - -TGGATCCAAGGACTTCTCTTCCACA 

GTGGCGGGCCCACTCAGCCCC-GAGACCACCCAACCTCACCCTCCTGACCCCCATGTAAA 

INN NNIIINI III I I III lllllllllll INN lllllll 

GTGGCCGGCCCACTCAATCCCAGGG-CCATTGG-CCTCACCCTCCC-ACCCC-ATGTAAA 
TATTGTTCTG - 



lllllll 
CGCTCTT 



T A AAT A AT AA AGAT GGT T r 



... _ 3-ATT 1783 

MliiliiMmLiii 1764 



Rattus norvegic 
CH R.norvegicus 



us hepsin (Hpn) , mRNA 
mRNA for hepsin 



>ref |NM_017112.1 
emb|X7 0900.l|RNHEPA El 

Length=1739 

GENE ID: 29135 Hpn | he 

Score = 1297 bits (702) 
Identities = 1341/1643 
Strand=Plus/Plus 

Query 184 ACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTG 24 3 



Gaps = 69/1643 



NNIIINI II III INN 

12 3 ACCCCAGGGTTCCGCCCCAGCCCAACAGGTCAACCTGGGAATCATTAACAAGAGTCCCTG 

244 ACATGGCGCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAG 3 0 3 

INI I II NNNNNNINNIN llllll II 

183 ACAT-G-GC - GAAGGAGGGTGGCCGGACTGCACCATGCTGTTCCAGACCCAAGGTGGCAG 23 9 

3 04 CTCTCACTGCGGGGACCCTGCTACTTC-TGACAGCCATCGGGGCGGCATCCTGGGCCATT 3 62 



Sbjct 



Sbjct 3 58 -GGACTCTCGACTTTTGGTGTTGGACAAGACAGA-GGGAACGTGGAGGCTGCTGTGCTCC 415 

Query 480 TCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCA 53 9 

II NNNNNNINNIN II III INI 1 1 1 1 1 1 1 1 1 1 II 1 1 llllllll 

Sbjct 416 TCACGCTCCAACGCCAGGGTAGCAGGGCTCGGCTGTGAGGAGATGGGCTTTCTCAGGGCT 475 

Query 540 CTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGGCTTCTTC 599 



: PubMed links) 
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TGCGTGGACGAGGGC-GGTCTGCCT 

TGATTGCCCCAGAGGCCGTTT 
II II II llllllll II 



TGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCT 



II I 1 1 1 1 MINIMI 

5 95 CGACTGTCCTAGAGGCCGATTCCTGACTGCCACCTGCCAAGACTGTGGCCGCAGGAAGCT 654 
GCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAGT 77 8 



Query 71'. 

Sbj ct 655 GCCGGTGGATCGCATTGTGGGGGGCCAGGACA 

Sbjct 715 CAGCCTGCGTTATGATGGGACC- 

Query 838 



Sbjct 
Sbjct 
Sbjct 



Sbjct 
Sbjct 
Sbjct 
Sbjct 
Sbjct 



TTGCCGGTGCCGTGGCCCAGG 



.. . I lllllll 

TACTGACCGCTGCACACTGCTTTCCAGAGAG3-- T T ' T 1 3AGTAT 83 3 



CCTCTCCCCACGGTC - TGCAGCTGGGGGTGCAGGCTGT 



I III 

TTGCTGGTGCTGTAGCCC-GGACCTCACCTCATGC-CGTGCAGCTGGGGGTTCAGGCTGT 

GGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCAAC-AGCGAGGAGAACAGCAACG 
' ' ' " II I I 



GATCTATCATGGGGGCTACCTTCCCTTTCGAGACCCTA-CTATCGACGAAAACAGCAATG 

ATATTGCCCTGGTCCACCTCTCCAG-TCCCCTGCCCCTCACAGAATACATCCAGCCTGTG 
I llllllllllllllllllll II I I I I I I I I IIIIIIIIIIIIMIIIIII II 
ACATTGCCCTGGTCCACCTCTCTAGCTCCC-TGCCTCTCACAGAATACATCCAGCCGGTT 

TGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGG 

TGTCTCCCTGCTGCGGGACAGGCCCTGGT 

GGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATA 
GGTAACACACAGTTCTATGGCCAGCAAGCTCT 



113 0 ATAAGCAACGAAGTTTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAACCCAAGATG 
12 54 TTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCC - 
llllllllllllll II llllllll llllllll llllllll llllllll II II 

1190 : ;agg-cca 



1372 GTTGGG " 1 7 ~*TGGCCCAGA, 



:ggggtacgggctgtg< 
gggagtggatcttcca — 

I I I I I I I I I I I I I I I I I I 



, I I I I I I I I I I I I I I 

1CTTTGGCCCGGAAGCCGGGAGTGTACACCAAAGTCATTGACT 

TCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCC 

1 1 1 1 1 1 1 1 1 M 1 1 I I I I I 



:ggctctgcggcattgtaa 

lCACCAAAGTCAGTGACT 



tccgggagtggatcttccaggccataaagactcactccgaagctaccggcatggtaactc 
agcctccagggcccgag- -G- 

Ill Mill I I III I 

AGCCCTGACCCCGCCTCATCGCCTGCTCCGCGCTGCTCCAGCATCCAGAGTCAGAGTTGG 
T- -GAT- -C-CCGGT-G- - - GTGGGA - - -TCCACGCTGGGCCG-AGI 
TCTGGTGGCTCCAGCCGCACGTGGCAGGCTCCACACTcLUcCTcic 



CTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCTTCCACA 



II II II II MM. 



GATCCAAGGATGC - -TGGGTCCAAGGJ 



-GAGAO:AO:CAAO:TCAO:CTO:TGACO:CCATGTAAA 



Query 1704 TATTGTTCTG - CTGTCTGGGA - CTCCTGT ^3-TGCTCTTT 

MM MM I llllll II II II III MMM II II MIMIIMM 

Sbjct 1661 TATTACTCTGTCC -TCTGGGGGCTGCTTTCGAGGCGCCCCT- -TG-TGCGGATGCTCTTT 



I I I I I I I I I 
AAGTCATTGACT 

GCATGGTGACCC 
lllllll II I 



AGGATGGGACG 



- ... iCGTTTTT 

- ATGGAACGGTTTT 



AAATAATAAAGATGGTTTTGATT 



>gb|AF030065.1|AF030065 1^313 Mus musculus serine protease hepsin mRNA, complete cds 
Length=1781 

GENE ID: 15451 Hpn | hepsin [Mus musculus] (Over 10 PubMed links) 

Score = 1280 bits (693), Expect = 0.0 
Identities = 1341/1646 (81%) , Gaps = 75/1646 (4%) 
Strand=Plus/Plus 

Query 184 ACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTG 24 3 
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Sbjct 



ACATGGCGCAGAAGGAGGGTGGCCGGACTG-T3::i.T 1 T - , : 3 3AAGGTGGCA 
I I I I I II I I I I I I I I I I I I I I I I I I I I I llllllllllllllllllllllllll 
ACAT - G - GC - GAAGGAGGGTGGCCGGACTGCAG - CATGCTGCTCCAGACCCAAGGTGGCA 



II III MINIM 



III MMM Ml Mill II MMIMIMI 



TGTGGCTGT1 4 19 

MM I I II II III M MMMI MMMI MM II 

TGTGACCATCCTACTGCAG-AGTGACCAGGAGCCACTGTACCAAGTGCAGCTCAG-TCCA 3 75 

GCGGACGCTCGGCT - - CATGGTCTTTGACAAGACGGAAGGG - ACGTGGCGGCTGCTGTGC 4 7 6 



T 1 ' 



TCCTCGCGCTCCAACGCCAGGGTAGC-G 3 I'- TG3GAGGA3 i — ,G3TTCCTCAGG 

Ill llllllll II II III MM 1 1 1 1 1 1 1 1 1 1 1 1 1 I MMM 



TTCTGTGTGGACGAGGGGAGG-CTGCCCCAC.-" I I GGTCATCTCCGT 655 

II II MMIMIMI II Mill I I MMM MMMI llllllll II 

TTTTGCGTGGACGAGGGC-GGACTGCCTCTGGCTCAGAGGTTGCTGGATGTCATCTCTGT 611 

GTGTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAA 715 



Mill II 



ATGTGACTGTCCTAGAGGCCGATTCCTGACTGCCACCTGCCAAGACTGTGGCCGCAGGAA 671 
GCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAG - CTTGGGCCGGTGGCCGTGGC 7 7 4 

MMM MMIMIMI Mill MM MMI Ml II III I 

gctgccggtggaccgcattgtggggggccaggacagcagtct-g " 

aagtcagccttcgctatgatgg-a 
'ggIcagcc! 

tgggtgctgacagccgcccactgcttcccggagcggaaccgggtcctgtcccgatggcga 
MMIMIMI II II II Mill II 1 1 1 II 1 1 M II II I 1 1 1 1 1 1 II MMM 

TGGGTGCTGACTG 1 TTGGTTT rGGTGTGTGGGTGGGG 

GTGTTTGCCGGTGCCGTGGCCCAGG-CCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGC 



iTTTG 



TGTGGT3TA33A3GGGGG3TAT3TT30: 



TGGGGTAACAC 



\CTTGCA - G - TGTTGGACAAGACGGA -GGGTACGTGGAGGCTACTGTG 



Jl llllllll Mill II MMIMIMI Mill IMIMIII 



MMMI III II I MM MMMMMMMMIMIMM 



MM MM 

m GTGATCTA 

' C ?ftH?l 
.TGACATTGC 

GTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGC 

Mill Mill Mill II MMMMMMMMIMM MMMI Mill III 

GTGTGTCTCCCTGCTGCGGGACAGGCCCTGGTGGATGGCAAGGTCTGTACTGTGACCGGC 



90 9 TGTGATCTATCATGGGGGCTACCTTCCCTTTCGAGACCCTA- CTATTGACGAAAACAGCA 96 7 

1070 

ATGACATTGCCTTGGTCCACCTCTCTAGCTCCC-TGCCT 102 6 

1130 



:agttctatggccaacaggctatggtgct 



ATAATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAG 

::::: n n mil n i imiimi m ii i i i i i i i i i i i i i i i 

ATCATAAGCAACGAAGTTTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAGCCCAAG 
ATGTTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGT 

: : : : 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n i mmimmimm ii ii 

ATGTTCTGTGCTGGCTATCCTGAGGGTG , Tl T TGCCAGGGCGACAGTGGAGGC 



MMM IMIMIII II M MM 



TTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACC 

MMMI MMMMMMMMMMIMIMM MMIMIMI 

TTCCGGGAGTGGATCTTCAAGGCCATAAAGACTCACTCCGAAGCCAGTGGCATGGTGACT 
CAGCTCTGA 

MM MM 

CAGCCCTGATCCCGCCTCATCTCGCTGCTCCGT 3 -. 3_-aGTCAGAGTT 



.Mill II Mill MMMI I MM I 



1446 
1536 
1506 



I II II I II MMM MIIIIMM I I MM II I IMIMIII 
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CAGTGGCGGGCCCACTCAGCCCC-GAGACCACCCAACCTCACCCTCCTGACCCCCATGTA 

lllllll llllllllll III I I III MINIMI HIM HIM 

CAGTGGCCGGCCCACTCAATCCCAGGG-CCATTGG-CCTCACCCTCCC-ACCCC-ATGTA 



TTTAAATAATAAAGATGGTTTTGATT 
TTTAAATAATAAAGGTGGTTTTGATT 



1 1 MMI MMM II II 



1759 



>ref |NM_001110252.1 | ^,13 Mus muse 

GENE ID: 15451 Hpn | hepsin [Mus 

Score = 1245 bits (674), Expect 
Identities = 1282/1569 (81%) , Gap 
Strand=Plus/Plus 



is hepsin (Hpn) , transcript varie 

;culus] (Over 10 PubMed links) 

1.0 

= 68/1569 (4%) 



Sbjct 
Sbjct 
Sbjct 
Sbjct 



AGGGTGGCCGGACTG-TGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGG 317 



118 ACCCTGCTACTTC-TGACAGCCATCGGGGC 3 i T -TT3TGGCTGTTCTCCT 376 



3 77 -CAGGAGTGACCAGGAGCCGCTGTACCC- ,1 - r T -GCGGACGCTCGGCTC 

III MMMIMIMM lllllll lllllll MM II I MM I Mill 

3 9 8 GCAG - AGTGACCAGGAGCCACTGTACCAAGTGCAGCTCAG - TCCAGGGGACTCACGGCTT 

4 3 5 ATGGTCTTTGACAAGACGGAAGGGA - CGTGGCGGCTGCTGTGCTCCTCGCGCTCCAACGC 
,56 GCGgUIIgAC^ 

194 CAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGA 

MMM II II III MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM III I Mill II 

5 1 5 CAGGGTGGCAGGGCTCGGCTGTGAGGAGATGGGCTTTCTCAGGGCTCTGGCGCACTCGGA 

5 5 4 GCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGGCTTCTTCTGTGTGGACGAGGG 

MMM Mill II MMIMIMI Mill MMMMIM II MMIMIMI 

375 G':tggatgtG':G':A':tG':ggG':G':':aA':gG':A':aT':ggG':tT':ttttG':gtgga>:gaggg 



Query 673 



694 



Sbjct 

Query 733 

Sbjct 754 

Query 7 92 

Sbjct 813 

Query 851 

Sbjct 872 

Query 911 

Sbjct 932 



GCCGATTCCTGACTGCCACCT 

gatgg-agcacacctctgtgggggatccctgctctccggggactgggtgctgacagccgc 

GaIgGGACC UiyyGlGGGGGGliiilGilGliTGGGGiilGGGlGilGiiTGiTGi 
CCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGT 

II Mill II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MMMM Mill Mill II 

ACATTGCTTTCCAGAGCGGAACCGGGTCCTGTCTCGGTGGCGAGTATTTGCTGGTGCTGT 



Query 970 GCTATCTTCCCTTTCGGGACCCCAAC-A 

Sb.ct 991 Gc!aCc!!cCc!!!oGAG^ 

Query 102 9 CACCTCTCCAG-TCCCCTGCCCCTCACAGAATACM 

Sbjct 1050 

Que ry 1088 CGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTA 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll Mill MMMM Mill MM 

Sbj ct 110 9 GGGACAGGCCCTGGTGGATGGCAAGGTCTGTACTGTGACCGGCTGGGGTAACACACAGTT 

Que ry 114 8 CTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGT 
I I I I I I I I I I I I I I I III Mill Mill II II Mill II Mill II II 
Sbj ct 116 9 CTATGGCCAACAGGCTATGGTGCTCCAAGAGGCCCGGGTTCCCATCATAAGCAACGAAGT 

Que ry 12 08 CTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTA 

Sbjct 1229 TTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAGCCC 



TCCTGAGGGTGGCATTGATGCGT 

iGTTGGGGCACTGGCTG 

llllllllll MM II I MMMM MMIMIMI II Mill II Mill 
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TGCTTTGGCCCGGAAGCCAGGAGTGT 

CCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGA - CCGG- - T 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 'MINIM MM MM M I I 

CAAGGCCATAAAGACTCACTCCGAAGCCAGTGGCATGGTGACTCAGCCCTGATCCCGCCT 



■T-CTC-G--CTGCGC-AGCCTCCAGGGCCCGAG- 



GTGGTGGGATCC ACG i l rTTTT TTCTTGGGCCCGGT 

MMI M MMI MMMI I MM I I Ml M M I M M 

CCCACGTGGTAGGCTCCACACTGGGCCTCJ-.C-.-_- .AGATCCAGT 



CCGTGCT^" 

GATCCAC 
I MMI 



Sbjct 
Sbjct 

QUSry MM I MM MM 

Sbj ct 1648 CCACGGGTCCAAGGATGC- -TGGATCCAAGGACTTCTCTTCCACAGTGGCCGGCCCACTC 
Query 1660 AGCCCC- GAGACCACCCAACCTCACCCTCCTGACCCCCATGTAAATATTGTTCTG - CTGT 

I III I I III MIMIIMM Mill MM I MM I I 

Sbj ct 1706 AATCCCAGGG - CCATTGG- CCTCACCCTCCC- ACCCC-ATGTAAATATTACTCTGTCC - T 

Que ry 1718 CTGGGACTC - CTGTCTAGGT - GCCCCTGATGATGG- GATGCTCTTTAAATAATAAAGATG 
Mill I II Mill llllll II II IIIIIIIIIIIIMMIMM II 
Sbj ct 1761 CTGGGGGGCGCT - -CTAGGGAGCCCCT- -TG-TGCAGATGCTCTTTAAATAATAAAGGTG 



Sbjct 



GTTTTGATT 
MIMIIII 
GTTTTGATT 



>gb | BC138809 . 1 | Mus musculus hepsil 

complete cds 

Length=1505 

Score = 1240 bits (671), Expect = 
Identities = 1075/1271 (84%) , Gaps 
Strand=Plus/Plus 



mRNA (cDNA clone MGC.l 



Sbjct 395 

Query 554 

Sbjct 455 

Sbjct 515 

Query 673 

Sbjct 574 

Sbjct 634 

Query 851 



?GGATGTGCGCACTGCGGGCGCCM 

GA ??"?i??? c r cA ? c< [t?t?? c i??i??t G ?i?tT?i? c ?n?i?t T i? c ?? c t?t? 

C -GGACTGCCTCTGGCTCAGAGGTT- .TGTGACTGTCCTAGAG 



I III II I MM llllllllllllllllllllllll 



GCCGATTCCTGACTGCCACCTGCCAAGACTGTGGCCGCAGGAAGCTGCCGGTGGACCGCA 63 3 

3 - CTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTAT 7 91 
Mill || Ml 



MMMIMIMM MM || Mllll 



CCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGT 

II Mill II I II llllll 

ACATTGCTTTCCAGAGCGGAACCGGGTCCTGTCTCGGTGGCGAGTA"! 



II II MM Mill I I MM MM MM Mill 



AGCCC-GGACCTCACCCCATGCTGTG7.---.77:- ■ .-.-.TCTATCATGGGG 8 



.MIIIIMM Mill I I I III II Mill 



llllllll II MM MM MMMMMMMMIMI Mill Mill Mill 



AGGGTGGCCGGACTG-TGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I I M 1 1 1 1 1 1 1 1 1 1 1 II III 

AGGGTGGCCGGACTGCAG-CATGCTGCTCCAGACCCAAGGTGGCAGCTCTCATTGTGGGT 
ACCCTGCTACTTC-TGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGTTCTCCT 
ACCCTGCTG-TTCCTGACAGGCATTG^ 



2 78 TCACGGCTT 33 5 



GCGGTGTTTGACAAGACGGA - C 
CAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGA 
MMM II II III MM I I I I I I I I I I I I I I llllllll III I Mill II 
CAGGGTGGCAGGGCTCGGCTGTGAGGAGATGGGCTTTCTCAGGGCTCTGGCGCACTCGGA 



Mill MMIIIMI 



IN. Mil II II 
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cggccaggccctggtggatggcaagatctgta::3t - tgg ggaacacgcagta 
II I I I I I I I I I I I I I I I I I I I I I lllllll Mil MINI Mill I I I I 

GGGACAGGCCCTGGTGGATGGCAAGGTCTGTACTGTGACCGGCTGGGGTAACACACAGTT 



1349 
1507 
1409 



I III IN MINI II I III I J 1,1.1 III 



TTGCAACAGCCCCGACTTCTACGGGAA^ 

CCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGA 
II I I I I I I I I I I I I I I I I I I'M II II I I I I I I I I I I I I I I II 

TCCTGAGGGTGGCATTGATGCGTGCCAGGGCGACAGTGGAGGCCCCTTTGTGTGTGAAGA 



148 CCAGGCCATAAAGACTCACTCCGAAGC-- , , 11 1 - GCTCTGA - CCGGTGG 



CTTCTCGCTGC 1517 



1419 



MNMMM 



pleen cDNA, RIKEN full-length enriched 



) PubMed links) 



>dbj |AK156553 .1 | 

Length^l745 P P 

GENE ID: 15451 Hpn | hepsin [Mus musculus] (Over 

Score = 1240 bits (671), Expect = 0.0 
Identities = 1282/1570 (81%), Gaps = 70/1570 (4%) 
Strand=Plus/Plus 

Query 2 5 9 AGGGTGGCCGGACTG-TGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III 

Sbjct 195 AGGGTGGCCGGACTGCAG-CATGCTGCTCCAGACCCAAGGTGGCAGCTCTCATTGTGGGT 

TGC r 

-AG'. 

<jTC' 

I I _ r 



ACCCTGCTACTTC-TGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGTTCTCCT 

Ill MINI III Mill II lllllll' | | || || 

TTCCTGAOG, 1 T ^ 1 1 ^-TCCTACT 



caggagtgA':':aggaG':':G':tgtA':':':agtG':agg-T':aG':T':t-': 

mi ::::iii iiiini inii n mm ii i mm i iiiii 

GCAG-AGTGA<"- TGI _ T 1 [ TCCAGGGGACTCACGGCT 



Query 4 93 

Sbjct 429 

Query 553 

Sbjct 489 

Query 613 

Sbjct 54 9 

Query 672 

Sbjct 608 

Query 732 
Sbjct 

Sbjct 



TGCGGTGTTTGACAAGACGGA - 
.CTCAGCTG 

III Mil I 



CCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCG 552 



I I I I I I I I I I I I I I I I I I I I I I I 



I 



M IIIII Mill II MIIIIMM 



56 8 



MIMI Mill II Mill II II Ml 

GC - GGACTGCCTCTGGCTCAGAGGTTGCTGGATGTCATCTCTGTATGTGACTGTCCTAGA 6 07 

GGCCGTTTCTTGGCCGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGC 731 
IIIII III II I MM IIIIIIIIIIIMMIIIIIIIMMIIII MIMIIII 

GGCCGATTCCTGACTGCCACCTGCCAAGACTGTGGCCGCAGGAAGCTGCCGGTGGACCGC 66 7 

ATCGTGGGAGGCCGGGACACCAG-CTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTA 790 

.GCAGTCT-GGGAAGGTGGCCGTGGCAGGTCAGC 726 



MIMI I I MMMMIIMM Mill II MIMIMMMIMM II I 



Query 910 TGGCCCAGG-CCTCTCCCCACGGTCTGC 1 T ;t ^GTGTACCACGGG 968 

I MM II MM IIIII I I MM MIMI MIMIIII MM II Ml 

Sbjct 84 6 TAGCCC -GGACCTCACCCCATGCTGTGCAACTGGGGGTTCAGGCTGTGATCTATCATGGG 9 04 

Query 969 GGCTATCTTCCCTTTCGGGACCCCAAC- AGCGAGGAGAACAGCAACGATATTGCCCTGGT 1027 

Sbjct 905 mMm^^l-WT^cMm^m^ 963 

Query 1028 CCACCTCTCCAG-TCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTG 1086 
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Sbjct 



Sbjct 



Sbjct 
Sbjct 
Sbjct 
Sbjct 



CCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGT 



1207 
1143 
1267 



Illllllllllll 

TTTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAGCCCAAGATGTTCTGTGCTGGCT 



ACCCCGAGGGTGGCATTGATGCCTG 7 :- :- 3 



XTGAGGGTGGCA: 



1443 
1546 



MINIMI III 

-TGCCTCTCACAGAATACATCCAGCCAGTGTGTCTCCCTGCTG 



GTGGTGGGATCCACGCTGGGCCG - 



1503 CCCCACGTGGTAGGCTCCACACTGGGCCTCAC - 

1599 . . 

mm :::::::::: : : :::: : ::::::::::::: m mmmm 

1562 TCCACGGGTCCAAGGATGC - - TGGATCCAAGGACTTCTCTTCCACAGTGGCCGGCCCACT 
16 5 9 CAGCCCC -GAGACCACCCAACCTCACCCTCCTGACCCCCATGTAAATATTGTTCTG - CTG 

II MM I 

:-ATGTAAATATTACTCTGTCC - 
TCTGGGACTC - CTGTCTAGGT - GCCCCTGATGATGG -GATGCTCTTTAAATAATAAAGAT 



327 A'7AG':aT':tcT':ggA':G':':A':gttgG':gG':tgtgtgG':attgtgagttgggG':A':tgG':t 

MIMIMMI MM II I MMMM MMIMIMI II Mill II MM 

263 ACAGCATCTCTGGGACATCAAGGTGGCGG'7T"-~ 3T - ;7ATTGTAAGCTGGGGTACGGGCT 
3 87 GTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCT 

MM MUM lllllllll II .MMMM II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

32 3 GTGCTTTGGCCCGGAAGCCAGGAGTGTACACCAAAGTCACTGACTTCCGGGAGTGGATCT 
447 TCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGA-CCGG- - 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMIMMI MM MM II I 

383 TCAAGGCCATAAAGACTCACTCCGAAGC'7A:-TG 3' IATGGTGACTCAGCCCTGATCCCGCC 



TCATCTCGCTGCTCCGTGCTGCACTAGCATCCAGAGTCAGAGTTGGTCTGGTGGCTCCAG 



,(7AGT(MA(MCCCCTTTGTGTGTGAAG 



>dbj |AK002694.1 | L^JE! Mus musculus adult male kidney cDNA, RIKEN full-length enriched 

library, clone : 0610030A17 product : hepsin, full insert sequence 

Length=1814 



GENE ID: 15451 Hpn | hepsir 



[Mus musculus] (Over 10 PubMed links) 

ect =0.0 
Gaps = 80/1573 (5%) 

AGGGTGGCCGGACTG-TGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGG 317 
AGGGlGGCOGG^^ 332 



392 



628 



ACCCTGCT - GTTCCTGACAGGCATTGGGGCCGCGTCCTGGGCCATTGTGACCATCCTACT 

- CAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCT - CTGCGGACGCTCGGCTC 

II I I I I I I I I I I I I I I I lllllll lllllll MM I I I MM I Mill 
GCA - GAGTGACCAGGAGCCACTGTA 7 " 7AGGGGACTCACGGCTT 



Sbjct 450 

Sbjct 509 

Query 554 

Sbjct 569 

Query 614 



GCGGTGTTTGACAAGACGG^ 

CAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGA 
CAGGGTGGCAGGGCTCGGCTGTGAGGAGATGGGCTTT 



GAGG-C 

I II Mill I I llllll lllllll MMMM II Mill II II MM 

GCGGACTGCCTCTGGCTCAGAGGTTGCTGGATGTCATCTCTGTATGTGACTGTCCTAGAG 



MM III II I MM 'MMMM lllllllll MIIIIMM 
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Sbjct 



TCGTGGGAGGCCGGGACACCAGCT-T33j 1 3:GTTCGCTAT 

I Mill I I I I Mill III I MM II I I I I I I I I I I I llllllll II III 
TTGTGGGGGGCCAGGACAGCAG-TCTGGGAAGGTGGCCGTGGCAGGTCAGCCTGCGTTAT 



Query 850 



1264 
1276 

1336 
1384 
1396 
1444 
1456 
1503 
1516 
1544 
1576 
1596 
1635 
1656 
1693 
1714 



Mill I I MMMMIIMM I llllll II I MM 1 II I 



CACATTGCTTTCCAGAGCGGGAGA-C-GCT 

CGTGGCCCAGG-CCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACG 

II MM II MM Mill I I MM II Mill IMIIIMI MM II I 

TGTAGCCC - GGACCTCACCCCATGCTG^ 3CAAC1 ATCATG 



CTGCGGGACAGGCCCTGGTGGATGGCAAGGTC 

AGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATG 
III MMMMMIMM III Mill Mill M II Mill II Mill I 
AGTTCTATGGCCAACAGGCTATGGTGCTCCAAG.- i i ' JGGTT ' - T -ATAAGCAACG 



Query 1204 



GCTACCCCGAGGGTGGCATTGATGCCT i ' IGGGG 'AG ' GGTGGTCCCTTTGTGTGTG 
GCTATCCTGAGGGTGGCATTGATGCGTGCCAGGGCGA 

GCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGA 



TGTTGGAGGGGATAAA_GAGTGAGTGGGAA_GGGAGGGGGATGGTGAGGGAGGTGTGA - GGG 

Mill II II II II II I II II II II II II II II I lllllllli MM MM II 

TCTTCAAGGCCATAAAGACTCACTCCGAAGCCAGTGGCATGGTGACTCAGCCCTGATCCC 
G- -TGG-CT T-CTC-G- - CTGCGC - AGCCTCCAGGGCCCGAG - - G - T - - GAT - - C - C 

II II I III I MM I III Mill I I III II II II 

GCCTCATCTCGCTGCTCCGTGCTGCACTAGCATCCAGAGTCAGAGTTGGTCTGGTGGCTC 

C -G GTGGTGGGATCCACGCTGGGCCG - AGGATGGGACGTTTTTCTT CTTGGGCC 

CAGCCCCACGTGGTAGGCTCCACACTGG^ 

CGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCC 
CAGTCCACGGGTCCAAGGATGC - -TGGATCCAGGGACTTCTCTTCCACA 



4 



ACTCAGCCCC -GAGACCACCCAACCTCACCCTCCTGACCCCCATGTAAATATTGTTCTG- 
II II I III I I III MIMMIMI Mill MIMIIMM MM 

ACTCAATCCCAGGG - CCATTGG- CCTCACCCTCCC - ACCCC - ATGTAAATATTACTCTC 

CTGTCTGGGACTC - CTGTCTAGGT - GCCCCTGATGATGG-GATGCTCTTTAAATAATAAA 
I llllll I II Mill llllll II II IN 
CC - TCTGGGGGGCGCT - - CTAGGGAGCCCCT - - 



Query 1771 GATGGTTTTGATT 



>ref|XM 512584.21 B PREDICTED: Pan troglodytes hepsin (transmembrane protee 

1) , transcript variant 3 (HPN) , mRNA 

Length=1572 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 
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MINIMUM 



TGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAG 



Query 778 
Sbjct 610 
Query 838 



TGCTGACAGCCGCCCACT :- :TTCCC \-_7 :- :- - 3AGTGT 8 9 7 

III MM MM II 

TGCTGACAGCCGCCCACTGCTTCC—" 



[■TCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGT 
TTGCCGGTGCCGTGGCCCAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I llllllll 



Query 1018 TTGCCCTGGTCCACCTCTCCAGTCCCCTGCCCCTCACAGAATACATCCAGCCTGTGTGCC 1077 



TTGCCCTGGTCCACCTCTCCAGTCCCCTC 



TCCCAGCTGCCGGCCAGGCCCTGGT 3 3 V riT 1 "GGCTGGGGCA 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



TCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCA 
ACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCA 1197 



I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I. 



ACACGCAGTACTATGGCCAACAGGCCG3 i ,T T '- 1 , - i 3-TCGAGTCCCCATAATCA 



Sbjct 
Sbjct 
Sbjct 



Score = 968 bits (524), Expect =0.0 
Identities = 528/530 (99%) , Gaps = 0/530 (0%) 
Strand=Plus/Plus 



MMMMMMMI 



M M M I 

3ACTGGG 



MMMMMMMI 



.TCCAGCCTGTGTGCC 



I I I I I I I I I I I I 



MMMMMMMI 



1314 TTTGTGTGTGAGGACAGCATCTCTCGGACGCCACGTTGGCGGCTGTGTGGCATTGTGAGT 

1101 TTTGTGTGTGAGGACAGCATCT^ 

13 74 TGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I 

1161 TGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGCGTCTACACCAAAGTCAGTGACTTC 

14 34 CGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAG 
12 21 CGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGAAGCCAGCGGCATGGTGACCCAG 
14 94 CTCTGACCGGTGGCTTCTCGCTGCGCAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGG 
1281 CTCTGACCGGTGGCTTCTCGCTC 



1554 
1341 
1614 
1401 
1674 
1461 



ATCCACGCTGGGCCGAGGATGGGACGT1 



rCTTCTTGGGCCCGGTCCACAGGTCCAAGG 



ATCCACGCTGGGCCTAGGATGGGACGTTT 

ACACCCTCCCTCCAGGGTCCTCTCTTCCACAGTGGCGGGCCCACTCAGCCCCGAGACCAC 

'.MINIMI! 

["TCCACAGTGGCGGGCCCACTCAGCCCCGAG 

CCAACCTCACCCTCCTGACCCCCATGTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTA 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMMMMMIIIIIIIIIIIIIII 
CCAACCTCACCCTCCTGACCCCCATGTAAATAT'! "TCCTGTCTA 



Query 1734 GGTGCCCCTGATGATGGGATGCTCTTTAAATAATAAAGATGGTTTTGATT 1783 



^Mi^cM^immmiMiW 15: 



Query 191 GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 250 



GgIcCCACCcIgGCCCA^ 

GCAGAAGGAGGGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCAC 310 

GCAGAAGGAGGGTGGCCGGA 265 

TGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 3 70 

I I I I I I I I I I I I I I I I I I I I I 

TGCGGGGACCCTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTGTGGCTGT 32 5 

TCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 4 3 0 

MM MM MM III MM I 

TCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAGCTCTGCGGACGCTCG 3 85 
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Ml Nil MM III M 



Query 491 CGCCAGGGTAGCCGGACTCAGCTGCGS T C AG 535 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 446 CGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAG 490 



>gb|BC072688.1| IliE Rattus norveg: 
complete cds 
Length=1580 

GENE ID: 29135 Hpn | hepsin [Rattus norvegi. 



hepsin, mRNA (cDNA clone MGC:91742 IMAGE : 7098661) , 



IS] (1C 



• PubMed links) 



0.0 

23/983 (2%) 

ACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTG 24 3 



lents for this subject sec 
Score Percent identity 
irt position Subject stai 



Score = 935 bits (506), Expect 
Identities = 828/983 (84%) , Gaps 
Strand=Plus/Plus 

Query 184 



Sbjct 27 

Query 244 

Sbjct 87 

Query 3 04 

Sbjct 144 



ACCCCAGGGTTCCGCCCCAGCCCAACAGGTC 



ACAT-G-GC- 



ACATGGCGCAGAAGGAGGGT , 1 , T "CAGACCCAAGGTGGCAG 

MM I I I M I ] I I I I I I I I I I I I 

'YTGCTGTTCCAGACCCAAGGTGGCAG 



Sbjct 



203 



CTCTCACTGCGGGGACCCTGCTACTTC-TGAC- 1 T 3 T^CTGGGCCATT 3 62 

MMMMI MMMMMM Ml MMM Ml Mill II MMIMIMM 

CTCTCACTGTGGGGACCCTGCTG-TTCCTGACAGGCATTGGGGCTGCGTCCTGGGCCATT 2 02 

GTGGCTGTTCTCCT - CAGGAGTGACCAGGAGCCGCTGTACCCAGTGCAGGTCAG- CTCTG 4 2 0 

MM c J c IMM_IMMMM ^ 

CGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGA - CGTGGCGGCTGCTGTGCTCC 4 7 9 

MM MM II MM II Illlllll II MM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

- GGACTCTCGACTTTTGGTGTTGGACAAGACAGA - GGGAACGTGGAGGCTGCTGTGCTCC 319 

TCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCA 53 9 
II I I I I I I I I I I I I I I I I I I I I II III MM MM' Illlllll 



Query 600 TGTGTGGACGAGGGGAGG-C 
Sbjct 



TGCGTGGACGAGGGC-GGTCTGCCT 



Query 65! 

Sbjct 49! 

Query 71! 
Sbjct 

Sbjct 

Query 838 

Sbjct 678 

Query 898 



TGATTGCCCCAGAGGCCG1 



rCTTGGCCGr"-T I l ,T ^GCCGCAGGAAGCT 718 



559 



619 



GCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAGCTTGGGCCGGTGGCCGTGGCAAGT 7 7 ! 

Ill Mill Mill Mill MM Mill MM MM I Mill Mill II 

GCCGGTGGATCGCATTGTGGGGGGCCAGGACAGCAGCCTGGGAAGATGGCCATGGCAGGT 6 1 ! 

CAGCCTTCGCTATGATGG-AGCACACCTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGG 83' 
MMM II Illlllll I I I I I I I I I I t t i f I I I I I I I ] I j 1 I I I I I I I I I I I I I 

CAGCCTGCGTTATGATGGGACC - CACCTCTGTGGGGGATCCCTGCTGTCCGGGGACTGGG 6 7 ' 

TGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGT 8 9 ' 

UIgACCGCTGCACA^^^ 73- 

TTGCCGGTGCCGTGGCCCAGG -CCTCTCCCCACGGTC -TGCAGCTGGGGGTGCAGGCTGT 9 5 ! 

IIgctggIgctgUccU^^^ 79, 



Query 956 GGTCTACCACGGGGGCTATCTTI 



TAAC-AGCGAGGAGAACAGCAACG 1014 



ga!c!a T cUgg^ 
query 1015 atattgccctggtccacctctccag -tcccctgcccctcacagaatacatccagcctgtg 

I I I I I I I I I I I I I I I I I I I I I II MM MM I I I I I I I I I I 1 ] I I I I I I I I III 

Sbj ct 855 ACATTGCCCTGGTCCACCTCTCTAGCTCCC-TGCCTCTCACAGAATACATCCAGCCGGTG 

Query 1074 TGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGCAAGATCTGTACCGTGACGGGCTGG 

M „ U 1 1 1 „ I 1 11 1 _ 1 1 , II M II M ^^ c ^ 1^ M I, Ml J I_ 1 1 M 1 1 



Query 1134 GGCAACACGCAGTACTATGGCCA 1156 



Score = 265 bits (143) 
Identities = 405/525 (7' 
Strand=Plus/Plus 



3-ACGCCACGT 13 4 9 
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Sbjct 



Sbjct 



'-GGAACATCAAGA 1051 

TGGCGGCTGTGTGGCATTGTGAGTTGGGGCACTGGCTGTGCCCTGGCCCAGAAGCCAGGC 14 0 9 



GAAGCTACCGGCATGGTAACTCAGCCCTGACCCCGCCTCATCGCCTGCTCCGCGCTGCTC 
- AGCCTCCAGGGCCCGAG- -G-T- -GAT- -C-CCGGT-G GTGGGA- - -TCCACGCTG 



MINI II MINIM II 



MM II MMMM 1 1 



MMMMIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



iGCATCCAGAGTCAGAGTTGGTCTGGTGGCTCCAC 



iGCCGCACGTGGCAGGCTCCAC 



GGCCTCAC-ATGGAACGGTTTTCTGCTCGGATCCAGTCCATAGAT( 



CACACTG 



1231 
1563 
1291 



III II I 1 1 1 1 1 1 1 II II II II II I MM MM I Ml I I Ml r 



ACCCTCCTGACCCCCATGTAAATATTGTTCTG-CTGTCTGGGA-CTCCTGTCTAGGTGCC 

::::m imm mimiiimi mm i immi ii ii ii mi mi 

ACCCTCCC-ACCCC-ATGTAAATATTACTCTGTCC-TCTGGGGGCTGCTTTCGAGGCGCC 
CCTGATGATG - GGATGCTCTTTAAATAATAAAGATGGTTTTGATT 1783 

" II II I I I I I I I I I I I I I I I I I I I I I I MMMM 

--TG-TGCGGATGCTCTTTAAATAATAAAGGTGGTTTTGATT 150 5 



Length=1887 
GENE ID: 15451 Hpn | hepsir 



[Mus musculus] (Over : 



PubUed 1 
Sort 



nts for this subject sequence by: 
is value Score Percent identity- 
Query start position Subject start positior 



Score = 880 bits (476), Expect = 0.0 
Identities = 944/1164 (81%) , Gaps = 56/1164 (4%) 
Strand=Plus/Plus 



Query 658 

Sbjct 720 

Sbjct 780 

Query 777 
Sbjct 



839 



gtgattgccccagaggccgtttcttggccgccatctgC':aagA':tgtgG':>:G':aggaaG': 717 

MIMMMIMM^ 7 „ 

TGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCAG - CTTGGGCCGGTGGCCGTGGCAA 77 6 
MM MMIMIMI Mill MM Mill III II III I I I I I I I I I I I I I 
TGCCGGTGGACCGCATTGTGGGGGGCCAGGACAGCAGTCT -GGGAAGGTGGCCGTGGCAG 83 8 

GTCAGCCTTCGCTATGATGG-AGCACkCCTCTGTGGGGGkTCCCTGCTCTCCGGGGACTG 83 5 

MMMM II MMMM I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM II MMMM 

GTCAGCCTGCGTTATGATGGGACC - CACCTCTGTGGGGGGTCCCTGCTGTCTGGGGACTG 8 9 7 

GGTGCTGACAGCCGCCCACTGCTTCCCGGAGCGGAACCGGGTCCTGTCCCGATGGCGAGT 8 9 5 

GGTGCTGACTGCTGCACATTC 95 7 

Query 896 GTTTGCCGGTGCCGTGGCCCAGG-CCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTG 954 
Sbj ct 958 ATTTGCTGGTGCTGTAGCCC - GGACCTCACCCCATGCTGTGCAACTGGGGGTTCAGGCTG 
Query 955 TGGTCTACCACGGGGGCTATCTTCCCTTTCGGGACCCCAAC - AGCGAGGAGAACAGCAAC 

II MM II MMMM MMIMIMI Mill III II II MMMM 

Sbj ct 1017 TGATCTATCATGGGGGCTACCTTCCCTTTCGAGACCCTA-CTATTGACGAAAACAGCAAT 
Query 1014 GATATTGCCCTGGTCCACCTCTCCAG-TCCCCTGCCCCTCACAGAATACATCCAGCCTGT 

II IMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM MM MMMM II 

Sbj ct 1076 GACATTGCCTTGGTCCACCTCTCTAGCTCCC-TGCCTCTCACAGAATACATCCAGCCAGT 

"'llMMMM^fltm^ 

Que ry 113 3 GGGCAACACGCAGTACTATGGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCAT 

1 1 1 M^^^ 

Query 1193 AATCAGCAATGATGTCTGCAATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGAT 
II Mill II II Mill II I MMMM II II I I I I I I I I I I I I I I I I I 
Sbjct 1255 CATAAGCAACGAAGTTTGCAACAGCCCCGACTTCTACGGGAATCAGATCAAGCCCAAGAT 

Que ry 12 53 GTTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTGCCAGGGCGACAGCGGTGGTCC 
I I I I I I I I I I I I I I I II : I I I I I MMMIMIMM II II II 

Sbj ct 1315 GTTCTGTGCTGGCTATCCTGAGGGTGGCATTGATGCGTGCCAGGGCGACAGTGGAGGCCC 



MMMIIMM MMMIIMM MM II I MMMM MMIMIMI II 
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1373 TTGGGGCACTGGCTGTGCCCTGGCC3-3- 3T3AGTGACTT 

Mill II llllllll MINI MINIMI II Mi' llllll 

14 3 5 CTGGGGTACCGGCTGTGCTTTGGCCCGGAAGCCAGGAGTGTACACCAAAGTCACTGACTT 



I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I i MIIIIMM 



GCTCTGA-CCGG- -TGG-CT- - -T- CTC-G- -CTGCGC-AGCCTCCAGGGCCCGAG- -G- 
GCCCTGATCCCGCCTCATCTCGCTGCTCCGTGCTGCACTAGCATC 

T - - GAT --C-CC-G GTGGTGGGATCCACGCTGGGCCG - AGGATGGGACGTTTTT 

I II I II I Mill II Mill Mill I MM I I III 

TCTGGTGGCTCCAGCCCCACGTGGTAGGCTCCACACTGGGCCTCAC-ATGGAATGGTTTC 



TATTGTTCTG - CTGTCTGGGACTC - CTGTCTAGGT - GCCCCTGATGATGG - GATGCTCTT 
TATTACTCTGTCC - TCTGGGGGGCGCT - -CTAGGGAGCCCCT- -TG-TGCAGATGCTCTT 



1760 TAAATAATAAAGATGGTTTTGATT 178 3 
1842 liAiliiliiiUGGllllGill 186 5 



Strand=Plus/Plus 



ACCCCAGGGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTG 
ACCCCAGGGTTCCGCCCCA^ 



Sbjct 203 ACAT-G-GC-GAAGGAGGGTGGCCGGACTGCAG-CATGCTGCTCCAGACCCAAGGTGGCA 258 
Query 303 GCTCTCACTGCGGGGACCCTGCTACTTC-TGACAGCCATCGGGGCGGCATCCTGGGCCAT 361 



I 51 33T3TCATTGTGGGTACCCTGCTG- 



CAGGAGTGAC CAGGi 



7 7 TCCTCGCGCTCCAACGCCAGGGTAGCCGGACT3- GCT 3C< 3A 3 A"-_GATGGGCTTCCTCAGG 53 6 

Mill llllllll llllllll II II III MM MIIIIIMIMM llllll 

3 3 TCCTCACGCTCCAATGCCAGGGTGGCAGGGCTCGGCTGTGAGGAGATGGGCTTTCTCAGG 4 92 

CACTGACCCACTCCG 

4 93 GCTCTGGCGCACTCGGAGCTGGATGTGCGCACTGCGGGCGCCAACGGCACATCGGGCTTC 552 

- - 3-CTGCCCCACACCCAGAGGCTGCTGGAGGTCATCTC 652 

TTTTGCGTGGACGAGGGC - GGACTGCCTCTGGCTCAGAGGTTGCTGGATGTCATCTC 60S 



T- 



ACGTGGCGGCTGCTGTGC 



■ GGGTACGTGGAGGCT ACTGTGC 



>ref |XM_001254640.1 | [£j PREDICTED: Bos taurus similar to hepsin (LOC7871C 
mRNA 

Length=779 

GENE ID: 787164 LOC787164 | similar to hepsin [Bos taurus] 



Score = 815 bits (441), Expect = 0 
Identities = 662/761 ■ ----- 

Strand=Plus/Plus 



Query 1052 
Sbjct 34 
Query 1112 



, Gaps = 46/761 (6%) 

i??I?????tt?t?T < M?It?It T TT??tt < Mfi? c ????r?I 



MMMMMMMMMMMIMM III I llllllll II 



AAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCCGAGGGTGGCATTGATGCCTG 
'HIM Mill 
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Sbjct 



Query 1528 



1352 GCGGCTGTGTGGCATTGTG-3T ' - T T I -T33CCCAGAAGCCAGGCGT 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCGGCTGTGTGGCATTGTGAGCTGGGGCAC 



.CCGGCTGTGCCC 1 : 

CTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGA 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



CTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAGACTCACTCCGA 



AGCCAGCGGCATGGTAACCCAGCTTT 3 3TCGCTGTGCACGCCTCCA 



llllllllllllll MINIM II I I I I 

! - — " GCTTTGACCTGTGG 

-CC---G G 

III I 



GGGCCCGAGCTGATCTAAGGGGCCCCAGCCCC-r3T i i 3TGGGCC - AGG 



Query 1572 -ATGGGACGTTTTTCTTCTTGGG^ 

UggItIctcttccctccIac 
gtcctctcttccacagtggcgggcccactcagccccgagaccacccaacctcaccctcct 



I I I I I I I I I I I I I I I I I I I I I I I 

- -ACAGTGGCG 3GGACCACCC- 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill.. . 

■-CCCCCATGTAAATATTGTTCTGCCATCTGGGATCCCCCCCCCCCATCTTG-TGCTCCT 



Sbjct 



I I I I I I I I I I I I I I I I I I I I 

:ctggcccagaagccaggcgj 



I I I I I I I I I I I I I I I I I I I I I I 



--TC-C--TGC- 



GAAGACAGGATGCTCTTTAAATAATAAAGATG' 



.48. 1| Mus musculus cDNA clone IMAGE : 4004431' 



Strand=Plus/Mim 
Query 448 
Sbjct 761 



Query 62 6 



Query 686 



III III III I I I I I I I I I I I I I I I I I I I I ! I M i I I I i I I I I I I II II 

AGA-GGATGGGAACGTGGAGGCTACTGTGCTCCTCACGCTCCAATGCCAGGGTGGCAGGG 
CTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGAGCTGGACGTGCGA 

III MM llllllllllllll 1 1 1 1 1 1 1 1 Ml I MM! Illlllll Mill 

CTCGGCTGTGAGGAGATGGGCTTTCTCAGGGCTCTGGCACACTCGGAGCTGGATGTGCGC 
ACGGCGGGCGCCAATGGCACGTCGGGCTTCTTCTGTGTGGACGAGGGGAGG-CTGCCCCA 

II MMIMIMI Mill MMIMIMI II I Ml II Mill I 

ACTGCGGGCGCCAACGGCACATCGGGCTTCTTTTGCGTGGACGA-GGGCGGACTGCCTCT 

c ???t T ?I???tt?t?I?I??????t??tt??I??? c ?ITTt????tf?I??? A ???? G 



Sbjct 
Sbjct 
Sbjct 
Sbjct 



GGACAGCAGTCT-C 

CTCTGTGGGGGATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCG 86 3 

c!c!g!gGGgUcCc!gcUc T GGGg1c^^ 346 

GAGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCCCAGG - CCTC 922 

GAGCGGAACCGGGTCCTOT 287 

TCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTATCTTCCCTT 982 

Mill I I MM Illlllll MM MM I I Illlllll Illlllll 

ACCCCATGCTGTGCAACTGGGGGTTCAGGCTGTGATCTATCATGGGGGCTACCTTCCCTT 22 7 

TCGGGACCCCAAC -AGCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTCTCCAG-T 104 0 
III Mill I I I III II Illlllll M MMM Ml' II I 

TCGAGACCCTA - CTATCGACGAAAACAGCAATGACATTGCCTTGGTCCACCTCTCTAGCT 16 8 



rTCTCACAGAATACATCCA^ 

Query 1101 GTGGATGGCAAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGGCCAACAG 1160 
Sb.ct 108 GTGGATGGCAAGGTCTGTACTGTGACCGGCTGGGG 



GCTATGGTGCTCCAAGAGGCCCGGGTTCCCAT 



>gb|BC119449.1| Mus n 
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Strand=Plu: 



Sbjct 702 

Query 567 

Sbjct 642 

Query 62 6 

Sbjct 583 

Query 686 

Sbjct 523 

Query 74 6 



448 AGACGGAAGGG - ACGTG 3^ j 1 [ i 1 T T r.-GGGTAGCCGGA 506 

III III III llllll I I I I HI Ml IIIIIMI || || 

761 AGA - GGATGGGAACGTGGAGGCTACTGTGCTCCTCACGCTCCAATGCCAGGGTGGCAGGG 7 03 

507 CTCAGCTGCGAGGAGATGGGCTTCCTCAGGGCACTGACCCACTCCGAGCTGGACGTGCGA 56 6 

III I I I I llllllllllllll IIIIIMI III I Mill IMIIIII Mill 

702 CTCGGCTGTGAGGAGATGGGCTTTCTCAGGGCTCTGGCACACTCGGAGCTGGATGTGCGC 643 



ACTGCGGGCGCCAACGGCACATCGG j 1 [ i 1 1 33- 3GA-GGGCGGACTGCCTCT 584 



GGACACCAG 
Mill III II 
GGACAGCAGTCT ■ 



GGCTCAGAGGTTGCTGGATGTCATCTCTGTATGTGACTGTCCTAGAGGCCGATTCCTGAC 52 4 

CGCCATCTGCCAAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCG 74 5 

MM I I I I I I I I I I I I II I II II II II II II II I I I I I I I I I I I I Mill MM 

TGCCACCTGCCAAGACTGTGGCCGCAGGAAGCTGCCGGTGGACCGCATTGTGGGGGGCCA 4 64 



CTTGGGCCG 7TATGATGG- 



•GGGAAGGTGGCCGTGGCAGGTCAG '7T 3 ' 3 T'TATGATGGGACC ■ 




GAGCGGAACCGGGTCCTGTCTCGGT 

TCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTATCTTCCCTT 



3T33kT333kk3kT3T3Tk333T3k33333T33333kk3k333k3TkCTkT3333kk3k3 

MMMIIMM lllllll Mill IIIIIMI Mill MM I 

JT ; T33^AAGGTCTGTACTGTGACCGGCTGGGGTAACACACAGTTC 

GCCGGGGTACTCCAGGAGGCTCGAGTCCCCAT 
II 

GCTATC 



\TGGTGCTCCAAGAGGCCCGGGTTCCCAT 



>gb|AC192150. 

sequence 
Length=2130i: 



Identities^ 
Strand=Plus/E 

Query 14 5 9 



• Pan troglodytes BAC clone CH251-522E19 frc 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



AGACTCACTCCGAAGCCAGCGGCAT 3 ,1 33T3T3.- T DT3T33 TGCG 1518 

AGACTCACTCCGAAGCCAGCGGC 17919, 

CAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAGGATGGGAC 157 8 

CAGCCTCCAGGGCCCGAGGTGATCCCGGTGGT 17925, 



Query 1579 



Sbjct 



TCCACAGTGGCGGGCCCACTCAG 7 ? 7 7 7-_-_ 7-_-_ 7 7A 7 7 7AA 7 ' 7 AT 16 9 8 

11 1 J I J U I I I I I I I I I I M I I I II II I I I J II I II I I I I Ml I II M I M I I I II II ,M 



1699 

179373 

1759 



GTAAATATTGTTCTGCTGTCTGGGACTCCTGTCTAGGTGCCCCTGATGATGGGATGCTCT 1758 



GTAAATATTGTTCT 
TTAAATAATAAAGATGGTTTTGATT 

!!iii!ii!AAiGi!GG!!!!Gi!! 
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Strand=Plus/Plus 
Query I 



Sbjct 



(98%) , Gaps = 3/202 (1%) 



17368S 

1035 

17374S 



lllllllllllllllll 'M MM MM III' 



CTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTC 
TCCAGTCCCCTGCCCCTCACAG 105 6 



TCCAGTCCCCTGCCCCTCACAG 



Strand=Plus/Plus 



CCTGCCAGGGC ' ir r 3"~TCTCTCGGACGCCAC 

Ml MMMMMMMMMMMMMMMIMMMMMMMMMMMMI 

CCTCCCAGGGCGACAGCGGTGGTCC-TT T "C-TCTCTCGGACGCCAC 



GTTGGCGGCTGTGTGGCATTGTGAGTTGG 3 3 rG 1 T , "T^GCCCAGAAGCCAG 



1866 GCGTCTACACCAAAGTCAGTGACTTCCGG3- ,T i 1 n '-33CCATAAAG 



Sbjct 
Sbjct 
Sbjct 

Score = 313 bits (169), Expect = 2e-81 
Identities = 172/173 (99%) , Gaps = 1/173 (0%) 
Strand=Plus/Plus 

Query 695 

Sbjct 173295 CCACAGACTGTGGCCGCAGGAAGCTGCTC 173 3! 
Query 754 GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGG 813 



TGCTTCCCGGAG 866 



M M M M M M M M M M M M 

3415 GATCC3TG IT ' 1 i i 3ACTGGGTGCTGACAGCC 



Identities 
Strand=Plu: 



:CGCCCACTGCTTCCCGGAG 



CAGGTGAGGCAGCCTGGCCTAGCJ-G , I T T ^CAGGCCGCCCG 97 

M M M M M M M M M M M M M II M M M M M M M M M M M I 

ijDjcc I5lb«u L.ii(j^rl\jiitjtjCAGCCTGGCCTAGCAGGCCCCACGCCACCGCCTCTGCCTCCAGGCCGCCCG 154" 

Query 98 CTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTAC 157 

' 

Sbjct 154740 CTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTAC 154: 
Query 158 CTCGAGGCTCCGCCCCCACCTGCTGGACCCCAGGGT 193 

Sbjct 154800 C^GiGGclciGCCCCcicclGC^IMlGGG! 154835 



Strand=Plus/Plus 



lllllllllllllllllllllllll. 



Query 1272 

lllllllllllllllllllllllll 
Sbjct 178545 GAGGGTGGCATTGATGCCTGCCAGG 178569 



Strand=Plus/Plus 



1271 
178544 



172624 CAGTGCAGGTCAGCTCTGCGGACGC7 Z 3 3_-_ :AAGACGGAAGGGACGT 
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Query 593 
Sbjct 172885 



Query 653 CGTGTG-TGA 
MINI III 
Sbjct 172945 CGTGTGGTGA 



TGGGCTTCCTCAGG 



CAGGGCACTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGG 5«2 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CAGGGCACTGACCCACTCCGAGCTGGACGTGCGAACGGCGGGCGCCAATGGCACGTCGGG 172 884 

172944 



Identities^ 10: 
Strand=Plus/Plu: 



GGTGGCCGGACTGTGCCATGCTGCTC i i GCTCTCACTGCGGGGACC 



Sbjct 162198 CTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTG 162240 

Score = 189 bits (102), Expect = 4e-44 
Identities = 102/102 (100%), Gaps = 0/102 (0%) 
Strand=Plus/Plus 

zz -132 ssnissnsisinnsi^ zl 

Query 1112 GATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGG 1153 

Sbjct 173192 GATCTGTACCGTGACGGGCTG^ 1782 33 

Score = 137 bits (74), Expect = le-28 
Identities = 74/74 (100%) , Gaps = 0/74 (0%) 
Strand=Plus/Plus 



1 5 5 413 GGTCCCACCCTGGCCCAGGAGOT 



Query 191 GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 250 

Sbjct 1554: 
Query 251 



Score = 80.5 bits 
Identities = 43/43 
Strand=Plus/Plus 

Query 363 - _. _ 

Sbjct 162323 ™ ' 



clone LIVER2 0I 



Strand=Plus/Plus 
Query 658 



Score = 590 bits (319) , 
Identities = 323/325 (9? 
Strand=Plus/Plus 



222 3 AGACTCACTCCGAAGCCAGCGGCATGGTGACCCAGCTCTGACCGGTGGCTTCTCGCTGCG 22 82 
1519 CAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAGGATGGGAC 15 78 



Sort alignments for this subject 
E value Score Percent identit 
Query start position Subject s 
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Sbjct 



GTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCT 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCT 



2463 
1759 
2523 



MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 



gtaaatattgttctG':tgT':tgggA':tcctgtctaggtG':':':>:tgatgatgggatG':T':t 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MMMMM 

GTAAATATTGTTCTGCTGTCTGGGACT '_ 3ATGACGGGATGCTCT 

TTAAATAATAAAGATGGTTTTGATT 17 83 



TTAAATAATAAAGATGGTTTTGATT 



Strand=Plus/Plus 



Sbjct 
Sbjct 
Sbjct 



CCTGCCAGGGCGACAGCG jT l TTTG r T T :GGACGCCAC 

Ml MMMMMMMMMMMMMMMMIMMMMMMMMMMMI 

CCTCCCAGGGCGACAGCGGT T 1 , T T -TCGGACGCCAC 



GCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAG 2 008 



Score = 268 bits (145) , 
Identities = 145/145 (1C 
Strand=Plus/Plus 

Query 1152 GGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGC 1211 
Sbjct 1514 GGCCAACAGGCCGG^ 1573 
Query 1212 AATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCC 12 71 

M M M M M M M M M M M M M M M M M M M MM M M M M M M M M M 

Sbjct 1574 AATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCC 1633 
Query 1272 GAGGGTGGCATTGATGC CTGC CAGG 12 96 



Identities = 104/104 
Strand=Plus/Plus 

Query 1050 CTCACAGAATACATCCAGCCTGTGTGCCTCCCAGCTGCCGGCCAGGCCCTGGTGGATGGC 

M M M M M M M M M M M M M I I I I I M I M I ! M I I I I M M I I I I M M M M 

Sbjct 1219 ~ " - ' ' ' " ■ ■ - 

TlTTIT TTTTTTTT I'M Mil TTTTTmT Ff TT NT TTTTTT TT 



>gb I AC020907 . 6 I Isl Homo sapiens chromosome 19 clone CTD-2527I21, complete sequence 
Length=169891 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 



Score = 590 bit 
Strand=Plus/Plu: 

Query 14 5 9 

Sbjct 79750 

Query 1519 



AGACICACICOGAA^^ 79809 

CAGCCTCCAGGGCCCGAGGTGATCCCGGTGGTGGGATCCACGCTGGGCCGAGGATGGGAC 1578 

CAGCCTCCAGGGCCCGAGGTGATCCCGGT 79869 

GTTTTTCTTCTTGGGCCCGGTCCACAGGTCCAAGGACACCCTCCCTCCAGGGTCCTCTCT 163 8 

TCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTGACCCCCAT 16 98 

M M M M M M M M M M M M M M M M MM MM M M M M M M M M M M 

TCCACAGTGGCGGGCCCACTCAGCCCCGAGACCACCCAACCTCACCCTCCTGACCCCCAT 7 9 989 
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Sbjct 



175 9 TTAAATAATAAAGATGGTTTTGATT 1783 

I I I I I I I I I I I I I I I I I I I I I I I I I 
80050 TTAAATAATAAAGATGGTTTTGATT 80074 



Score = 353 bits 
Identities = 199/ 
Strand=Plus/Plus 

Query 858 

Sbjct 

Sbjct 74179 

Query 975 

Sbjct 74239 

Query 1035 




CTTCCCTTTCGGGACCCCAACAGCGAGGi 
TCCAGTCCCCTGCCCCTCACAG 1056 



Sbjct 



7429! 



Sbjct 79423 



TCCAGTCCCCTGCCCCTCACAG 7432 0 

bits (170), Expect = 6e-82 
: 172/173 (99%) , Gaps = 0/173 (0%) 
Plus 

CCTGCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGACAGCATCTCTCGGACGCCAC 

; : 

93 6 3 CCTCCCAGGGCGACAGCGGTGGTCCCTTTGTGTGTGAGGACAGCATCTCTCGGACGCCAC 



.'CAGCCTTCGCTATGATGGAGCACACCTCTG' 



T T T T T T 3 T ^GGAG 866 

INNNIMIIIIIIIIMI^ ?401e 



1347 
79422 



lllllllllllllll llllllllllllll 



14 3 '3T -T- ' 3T 31 IT 1 , , - 3T 3GATCTTCCAGGCCATAAAG 1460 

lllllllllllllllllllllllllllllllllllllllllllllllllllll 

Sbjct 79483 GCGTCTACACCAAAGTCAGTGACTTCCGGGAGTGGATCTTCCAGGCCATAAAG 79535 

Score =i 313 bits (169), Expect = 2e-81 
Strand=Plus/Plus 

Query 695 CCA-AGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCA 

III llllllllllllll 
Sbj Ct 73844 CCACAGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCA 

Query 754 GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCT; 

Sbjct 73004 GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCG 



I I I I I 

;tgggg 



Strand=Plus/Plu£ 



CAGGTGAGGCAGCCTGGCCTAGCAGG" 1 I 1 -GGCCGCCCG 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
-"""GCAGCCTGGCCTAGCAGGCCCCACGCCACCGCCTCTGCCTCCAGGCCGCCCG 



55222 CAGGTGAGGC 



CTGCTGCGGGGCCACCATG T I , TGGAGAC1 7 CGGCACTAC 

- MM Ml i 

CTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTAC 
CTCGAGGCTCCGCCCCCACCTGCTGGACCCCAGGGT 19 3 
CTCGAGGCTCCGCCCCCACCT""""" ' "~ ' ~™ 



Strand=Plus/Plus 

Que ry 1152 GGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGC 
Sbjct 79041 GGCCAACAGGCCGGGGTACTCCAG 

Query 1212 AATGGCGCTGACTTCTATGGAAACCAGATCAAGCCCAAGATGTTCTGTGCTGGCTACCCC 

Sbjct 79101 AATGGCGCTGACTTCTATGGAAACCAGATCAAG 

Query 12 72 GAGGGTGGCATTGATGCCTGCCAGG 12 96 

lllllllllllllllllllllllll 
Sbjct 7 9161 GAGGGTGGCATTGATGCCTGCCAGG 7 9185 

Score = 248 bits (134), Expect = 6e-62 
Identities = 134/134 (100%) , Gaps = 0/134 (0%) 
Strand=Plus/Plus 



79100 
1271 
79160 
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Query 52 3 
Sbjct 73293 



Score = 233 bits (12 
Identities = 129/130 
Strand=Plus/Plus 



Sbjct 73374 
Query 593 



CTTCTTCTGTGTGGACGAGGGGAGGCTG ' - - . 1 'T i 3.-.GGTCATCTC 

MMMMMMMMMMMMMMMMIMMMMMIMIMM 

CTTCTTCTGTGTGGACGAGGGGAGGC". 7TGCTGGAGGTCATCTC 



Query 653 CGTGTG-TGA 



Sbjct 734 94 CGTGTGGTGA 



Score = 193 bits (104 
Identities = 104/104 0 
Strand=Plus/Plus 



Sbjct 78746 
Sbjct 78806 



cIcacaga! 
aagatctgt 

MINIMI 



Score = 191 bits (103 
Identities = 103/103 ( 
Strand=Plus/Plus 



I II II II II 

CTGCTACTT' 



4gctctgcggacgctcggctcat(Utctttga 

j??I??I?????I??tt?T??tfMlt???fit?!?t??I???t??t?t 



536 
73306 



ccgtgacgggctggggcaacacgcagtactatgg 

MIIIIMI 



AAGATCTGTACCGTGACGGGCTGGGGCAACACGCAGTACTATGG 7884 9 



0/103 



261 GGT, r I F J ,T I I - 320 

62 7 92 GGTGGCCGGACTGTGCCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCACTGCGGGGACC 6285] 
CTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTG 3 6 3 



IIIIIIIIIIIIIIIIIIIIMMMMMMI 

:tgacagccatgggggggggatggtgggggattg 



Strand=Plus/Plus 

Query 191 ggtcccaccctggcccaggaggtcagccagggaatcattaacaagaggcagtgacatggc 

Sbjct 55955 iiWJiMMMM^ 
Query 2 51 GCAGAAGGAGGGTG 2 64 



Score = 82 . 4 bits 
Identities = 44/4 
Strand=Plus/Plus 



I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I 

TCGAGCCCGCTTTCCAGGGACCCT-. 3A 54052 



:and=Plus/Plus 



GTGGCTGTTGTGCTGAGGAGTGAGGAGGAGGGGGTGTAGGGAG 40 5 

:::::::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [i 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTGGCTGTTCTCCTCAGGAGTGACCAGGAGCCGCTGTACCCAG 63 019 



Score = 78.7 bits 
Identities = 42/42 
Strand=Plus/Plus 

Query 658 GTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATCTGCCAAG 699 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



73631 GTGATTGCCCCAGAGG::3TTT Tj :;ATCTGCCAAG 73672 
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>dbj |AK125670.1 | Ll-i Homo sapiens cDNA FLJ43682 fi: 
to SERINE PROTEASE HEPSIN (EC 3.4.21.-) 
Length=2831 



clone TBAES200: 



, weakly similai 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 



-MINIMI' 

TTCCCTGGTAGGCGGAACCGGGTCCTGTCCCGATGGCGAGTGTTTGCCGGTGCCGTGGCC 14 96 
CAGGCCTCTCCCCACGGTCTGCAGCTGGGGGTGCAGGCTGTGGTCTACCACGGGGGCTAT 974 



CAGGCCTCTCCCCACGGK 



Sbjct 



aacaM 

i 1 1 mt? 



:gatattgccctggtccacctc 
.11 III Ml I I I I M I M I M I 



1557 CTTCCCTTTCGGGACCCCAACAGCGAGGAGAACAGCAACGATATTGCCCTGGTCCACCTC 1 6 1 e 



Score = 313 bits (16! 
Identities = 172/173 
Strand=Plus/Plus 



CCA-AGACTGTGGCCGCAGGAAGCTGCCCGTGGACCGCATCGTGGGAGGCCGGGACACCA 



Jlllllllllllllllll 



GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGG 813 



I I I I I I I I I I I I I I I I I I 



Sbjct 1222 GCTTGGGCCGGTGGCCGTGGCAAGTCAGCCTTCGCTATGATGGAGCACACCTCTGTGGGG 12 81 
Query 814 GATCCCTGCTCTCCGGGGACTGGGTGCTGACAGCCGCCCACTGCTTCCCGGAG 866 



Score = 248 bits (134), Expect = 6e-62 
Identities = 134/134 (100%) , Gaps = 0/134 (0%: 
Strand=Plus/Plus 



Query 403 CAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGT 462 



CAGTGCAGGTCAGCTCTGCGGACGCTTO 



llllllllllllllllllllllllll 



Ml 



i 1,-7- , "TGCGAGGAGA 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
GGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTGCGAGGAGA 

TGGGCTTCCTCAGG 53 6 
I I I I I I I I I I I I I I 
TGGGCTTCCTCAGG 62 3 



Identities" 129/ 
Strand=Plus/Plus 



Query 533 CAGGGCACTGACCCACTCCGAGCTGGACGT 3 1 1 ~_~TGGCACGTCGGG 592 

CAGGGCACTGACCCACTCCGAGCTGGACGTGCG 750 



Sbjct 691 
Query 593 



CTTCTTCTGTGTGGACGAGGGGAGGCTGCCCC^ 



Query 653 CGTGTG-TGA 661 
Sbjct 811 82 0 



>gb|DQ677665.1| SSS I 

gene, complete cds 
Length=15819 



channel beta- 



Sort alignments for ti 
Query start positior 
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Gaps = 0/156 (0%) 



CAGGTGAGGCAGCCTGGCCTAGC-33 J 3T33-G3CCGCCCG 97 

II MM MM MM III I 

JCCCCACGCCACCGCCTCTC 
CTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCCGACCCCGGCACTAC 
iiiiii ii iiiiiJ iiJiiiiiiiiiiiJ I iiiii I J I J iiil^i^^icTAC 



1415 0 CTGCTGCGGGGCCACCATGCTCCTGCCCAGGCCTGGAGACTGACCC 



MMMMMMMMMMMMMMMMMII 



md=Plus/Plus 



GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 
GCAGAAGGAGGGTG 2 64 



Score = 82 . 4 bit 
Identities = 44/ 
Strand=Plus/Plus 



TCGAGCCCGCTTTCCAGGGACCCTACCTGAGGGCCCACAGGTGA 

MIMMMMIIIIIIIIMlim 



>gb | AC197610 .3 | I 

sequence 
Length=1587 3 3 



: MACACA MULATTA BAC clone CH250-348G8 frc 



Sort alignments for this subject sequence by: 
E value Score Percent identity- 
Query start position Subject start positior 



Score = 250 bits (135 
Identities = 149/156 
Strand=Plus/Plus 



Query 3 8 
Sbjct 149463 



caggtga3'3cagcctggcctaG':agG':':':':A':G':':A':':G':':T':tG':':T':':agG':':G':':>:g c ^ 



Query 98 
Sbjct 149523 



149522 

CTGCTGCGGGGCCACCATGCTCCTG 157 

149582 



cIgcIgogggg^^ 

ctcgaggctccgcccccacctgctggaccccagggt 19 3 
ctccaggctccgccctcacctgccggactc 14 96] 



Identities = 
Strand=Plus/P 

Query 261 

Sbjct 157187 

Sbjct 157247 



Gaps = 0/103 (0%) 



GGTGGCCGGACTGTGCCAT 3 l l T T31 ^TGCGGGGACC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 

GGTGGCCGGACTGTGCCATGCTGCTCCGGACCCAAGGTGGCAGCTCTCACTGCAGGGACC 
CTGCTACTTCTGACAGCCATCGGGGCGGCATCCTGGGCCATTG 3 63 

: ::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGCTACTTCTGACAGCCATCGGGG" 1 - i 3 -TT3 157289 



Score = 132 bits 
Identities = 73/74 
Strand=Plus/Plus 



Score = 75.0 bits 
Identities = 42/4 
Strand=Plus/Plus 



GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCAGTGACATGGC 

: :::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mmmmm 

GGTCCCACCCTGGCCCAGGAGGTCAGCCAGGGAATCATTAACAAGAGGCGGTGACATGGC 



GTGGCTGTTCTCCTCAGGAGTGAC3.- 3 1 3CTGTACCCAG 405 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I [ 1 1 1 i 1 1 1 1 1 1 Mill 
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>ref |XM_001719305.1 | LSJ PREDICTED: Homo sapiens hypothetical protein LOC100128675 (LOC100128675) , 
mRNA 

Length=1287 

Score = 207 bits i 
Identities = 112/1] 
Strand=Plus/Minus 



Query 463 GGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTG 514 
Sbjct 4 53 GGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAG 402 

>ref |XM_001721961.1 | 13 PREDICTED: Homo sapiens hypothetical protein LOC100128675 (LOC100128675) , 
mRNA 

Length=1287 

Score = 207 bits (112), Expect = le-49 
Identities = 112/112 (100%), Gaps = 0/112 (0%) 
Strand=Plus/Minus 

Query 403 CAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGT 462 

II MM MM MM III II 

Sb]Ct 513 CAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGT 454 
Query 463 GGCGGCTGCTGTGCTCCTCGCGCTCCAACGCCAGGGTAGCCGGACTCAGCTG 514 
Sbjct 453 GGCGGCTGCTGTGCTCCT CGCG 402 

>re£ |XM_001719287.1 | DJH PREDICTED: Homo sapiens hypothetical protein LOC100128675 (LOC100128675) , 
mRNA 

Length=1287 

Score = 207 bits (112), Expect = le-49 
Identities = 112/112 (100%) , Gaps = 0/112 (0%) 
Strand=Plus/Minus 

CAG1 

MM 

Sb] ct 513 CAGTGCAGGTCAGCTCTGCGGACGCTCGGCTCATGGTCTTTGACAAGACGGAAGGGACGT 

>gb | AC158993 . 2 | ill Mus musculus BAC clone RP24-427N13 from chromosome ~ 
sequence 
Length=179746 



Sort alignments for this subject sequence by: 
E value Score Percent identity 
Query start position Subject start positior 



Identities 4 = 12 
Strand=Plus/Mir 



1152 GGCCAACAGGCCGGGGTACTCCAGGAGGCTCGAGTCCCCATAATCAGCAATGATGTCTGC 1211 

MMMMMI Ml MMI MMI M M MMI I I MMI M I I Ml 

6180 3 GGCCAACAGGCTATGGTGCTCCAAGAGGCCCGGGTTCCCATCATAAGCAACGAAGTTTGC 6174 4 

61743 61684 
12 72 GAGGGTGGCATTGATGCCTGCCAGG 12 96 
GAGGGTGGCATTGATGCGTGCCAGG 61659 



61683 



7134 7 GGTGGCCGGACTGCAG - CATGCTGCTCCAGACCCAAcUtGGCAGCTCTC 

32 0 CCTGCTACTTC-TGACAGCCATCGGGGCGGCATCCTGGGCCATTG 3 63 

llllll III llllll III Mill II MMI 
71288 CCTGCTG-TTCCTGACAGGCATTGGGGCCGCGTCCTGGGCCATTG 71245 
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Database: All GenBank+EMBL+DDBJ+PDB sequences (but 
samples or phase 0, 1 or 2 HTGS sequences) 
Posted date: May 19, 2008 5:44 PM 
Number of letters in database: -2,000,84 9,822 
Number of sequences in database: 6,839,787 

Lambda K H 



ful < 



.- of t 



;han 10 without gapping: 
Number of HSP's gapped: 16 
Number of HSP's successfully gapped: 16 
Length of query: 1783 
Length of database: 23768953950 
Length adjustment: 3 3 
Effective length of query: 1750 
Effective length of database: 23543240979 
Effective search space: 41200671713250 
Effective search space used: 41200671713250 
A: 0 



XI : 



(28 



8 bits) 
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